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ABSTRACT 

The  problem  addressed  in  this  thesis  is  the  maximization  of  the 
expected  matched  filter  receiver  response  to  a  signal  that  has  been 
transmitted  through  a  communication  channel  whose  average  scattering 
properties  are  known  in  terms  of  a  scattering  function.  This  is 
accomplished  by  altering  the  receiver  processing  signal  given  the  channel 
scattering  function  and  transmit  signal.  The  channel  is  assumed  to  be 
doubly-spread,  meaning  that  any  signal  propagated  through  it  will  exhibit 
both  time  and  frequency  spreading.  The  scattering  functions  that 
describe  these  channels  subtend  a  finite  region  in  the  delay-dopper 
plane. 

This  thesis  contains  some  of  the  background  material  necessary  to 
understand  the  modeling  of  communication  channels  as  random  linear 
time-varying  systems  and  the  use  of  matched  filter  receivers  for  signal 
detection.  This  material  includes  a  review  of  the  properties  of  linear 
spaces,  Fourier  transforms,  and  the  foundational  material  leading  to  the 
development  of  the  scattering  function. 
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CHAPTER  1 


GENERAL  INTRODUCTION 

In  active  signal  detection  systems,  detection  is  performed  by 
transmitting  a  signal  over  a  communication  channel,  processing  any 
received  signal  by  a  receiver,  measuring  the  receiver  output,  and 
comparing  the  output  to  a  predetermined  threshold.  The  optimum  receiver 
in  Gaussian  white  noise  is  known  to  be  a  matched  filter  receiver,  also 
known  as  a  correlation  receiver.  It  consists  of  a  multiplier  used  to 
form  the  product  between  the  received  signal  and  processing  signal,  and 
an  integrator  to  integrate  the  multiplier  output. 

Channel  scattering  having  a  delay  extent  greater  than  the  average 
wavelength  of  the  transmitted  signal  is  said  to  be  delay-spread.  On  the 
other  hand,  channels  whose  scattering  properties  vary  rapidly  in  a  time 
interval  on  the  order  of  the  signal  time  duration  are  said  to  be 
doppler-spread.  Any  communication  channel  exhibiting  a  combination  of 
delay  spreading  and  doppler  spreading  is  said  to  be  -loubly-spread . 
Examples  of  doubly-spread  sctterers  are  fish  schools,  volcanic  plumes, 
storm  cells,  rotating  planets,  and  asteroids. 

A  portion  of  the  transmitted  energy  can  also  be  scattered  by  the 
medium  and  is  referred  to  as  clutter  or  reverberation.  Both  clutter  and 
reverberation,  which  may  also  be  doubly-spread,  represent  interference 
and  degrade  the  ability  of  the  matched  filter  receiver  to  detect  the 
channel  output  signal. 

This  thesis  is  concerned  with  the  optimization  of  a  matched  filter 
receiver  to  detect  a  signal  that  has  been  transmitted  through  a 
doubly-spread  communication  channel  whose  scattering  properties  are 


h(t) 


a  (t) 


LINEAR 

TIME- INVAR!  ANT 
SYSTEM 


Figure  1.  Block  diagram  of  a  linear  time-varying  system. 


Figure  2.  Derivation  of  (2.3-1).  Input  x(t)  approximated  by 
a  pulse  train.  The  response  at  time  t  due  to  the 
portion  of  the  input  approximated  by  the  shaded 
panel. 
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then  interpreted  as  an  amplitude  density  function  for  the  complex 
exponentials  included  in  the  continuous  sum  (integral)  in  (2.2-26).  It 
is  for  this  reason  that  the  Fourier  transform  is  also  referred  to  as  the 
'spectrum'  of  the  function  x(t),  since  it  gives  a  measure  of  the  relative 
weighting  of  each  complex  exponential  as  a  component  of  the  function 
x(t).  The  terms  Fourier  transform  and  spectrum  will  be  used 
interchangeably  throughout  this  thesis. 

2. 3  Linear  System  Theory 

2.3.1  Linear  Time-Invariant  and  Time-Varying  Systems 

Linear  time-invariant  systems  are  often  described  by  a  function, 
h(t),  referred  to  as  the  'system  impulse  response  function.'  Its  name 
decribes  exactly  what  it  does:  it  expresses  the  system  output  y(t) 
when  the  system  input  x(t)  is  an  impulse  function  with  unity  weight. 

The  impulse  response  does  provide  another  service.  It  can  be  used  to 
determine  the  system  output  for  general  inputs,  and  this  is  expressed  by 
the  convolution  between  h(t)  and  the  input  x(t),  i.e., 

00 

y(t)  =  /  h(x)  x(t-r)  dx  .  (2.3-1) 

—  oo 

The  derivation  of  (2.3.1)  can  be  developed  in  the  time  domain  where 
the  integral  is  considered  to  be  a  limit  of  a  sum  of  responses  to  a 
square  pulse  train  that  approximates  the  input  x(t).  Figure  1  shows  the 
linear  time-invariant  system  as  a  block  diagram  and  illustrates  its 
interpretation  in  terms  of  the  impulse  response.  Figure  2  shows  how  the 
input  function  is  approximated  by  a  series  of  square  pulses  with  width 
4 x  and  height  x(t-x^).  Note  that  the  system  response  to  the  i-th  pulse, 
y^(t),  can  be  approximated  by  the  response  to  an  impulse  applied  at  time 
t  —  x ^  whose  weight  is  equal  to  x(t-x^)  dx,  therefore, 
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Theorem  2. 7  Let  x(t)  £  L^(R^)  and  be  bounded  (|x(t)|  <  00 )  for  all  t  £  R*-. 
Then  x(  t )  £  L2(R1). 

Proof  Define  Che  normalized  function  x  by 


sup  |x(t)| 


(2.2-23) 


Where  sup  |x(t)|  <  00  because  x(c)  is  bounded  by  hypothesis.  Then 
c 

|x(t)|  <  1  for  all  t  £  R*.  Moreover,  |x(t)j2  <_  |x(t)|  so 


U  x  II  ^  “  /  |x(t)|2  dt  <_  /  j  x  ( t )  |  dt  =  II  x  II  ^  < 


(2.2-24) 


Therefore,  x(t)  £  L2(r1)  and  x(t)  *  (sup  |x(t)|)  x(t)  £  L2(R^)  because 

t 

L2(Rl)  is  a  linear  space.  This  proves  Che  theorem. 

It  should  be  noted  that  the  converse  of  this  theorem  is  not  true 
because  there  exist  functions  in  both  L^(R^)  and  L2(Rl)  that  are  not 
bounded.  The  following  theorem  will  be  particularly  useful  later  when 
finite  energy  signals  are  discussed.  Its  proof  can  be  found  in  any  text 
on  functional  analysis. 

Theorem  2.8  (Schwartz  Inequality)  If  x(t),  y(t)  £  L2(R*)  then 


<x,y>|  <  U x II ^  II y II 2* 


(2.2-25) 


Equality  holds  if  and  only  if  y(t)  =  Xx(t)  or  y  =  0  where  X  is  a  real 


constant . 


2.2.3  The  Interpretation  of  the  Fourier  Transform 


In  engineering  analysis,  the  inverse  Fourier  transform,  given  by 


:(t)  =  /  X(f)  ej2*tC  df, 


(2.2-26) 


is  often  given  Che  interpretation  chat  it  expresses  the  function  x(t)  as 
a  continuous  sum  of  complex  exponential  functions.  The  function  X(  '  is 
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Theorem  2.5  Let  x(t)  £,  L^(r1)  and  define 

X(f,a)  =  /  x(t)  e“J27rft  dt.  (2.2-19) 

-a 

then  as  a  -►  X(f,a)  converges  in  the  mean  over  R'-  to  a  function  X(f)  in 
L^(r)).  Conversely, 

x(t,a)  =  J  X(f)  ej2lTft  df  (2.2-20) 

-a 

converges  in  the  mean  to  x(t)  almost  everywhere  over  R^  as  a  +  ". 

From  this  theorem  it  is  possible  to  prove  another. 

Theorem  2.6  The  Fourier  transform  (2.2-19)  defines  a  bounded  linear 
transform  of  L^(r1)  to  L^(R^)  which  is  norm  preserving,  i.e., 

“XU  2  =  it x I  2 •  (2.2-21) 

This  theorem  states  that  Fourier  transform  maps  l2(r!)  to  L^(r1),  and  so 
it  defines  an  isomorphism  of  L^(R^)  onto  itself.  The  symmetry  between 
any  element  of  L^(Rl)  and  its  Fourier  transform  is  not  an  unusual 
property  for  a  Hilbert  space.  For  example,  it  can  be  shown  that  in  a 
Hilbert  space  x  all  bounded  linear  functionals  are  of  the  form 

f(x)  =  <x,$>  (2.2-22) 

for  all  x  £  x  where  is  also  in  x  (a  Hilbert  space  is  its  own  algebraic 
dual ) ^ . 

Before  discussing  the  appliction  of  Fourier  transforms  and 
convolution  to  the  analysis  of  systems  in  section  2.3,  two  theorems  are 
stated  that  will  be  useful  later. 


By  Fubini's  theorem^* 6  it  follows  that 


//  | x( T )  y(t-i) |  dTdt 

—  OO 

CO  OO 

<  /  |x(t)|  dT  /  |y(t-t)|  dt 

—  ao  —oo 

=  lx II j  Uy tl A  .  (2.2-17) 

Therefore,  from  (2.2-16)  and  (2.2-17)  it  can  be  seen  that 

« x*y  1  i  <  8x11 1  lyl i  <  «  (2.2-18) 

and  so  x*y  £  L^(R^).  Furthermore,  since  a(x(t)  y(t-t))  -  (ooc(t))  y(t-x) 

=  x(x)  (ay(t-t))  for  all  a  £  R*-  this  establishes  L^(R^)  as  an  algebra 
under  convolution. 

Since  the  convolution  of  two  functions  in  L^(R^)  produces  another 
function  also  in  L^R1)  it  immediately  follows  that  this  function  is 
Fourier  transformable.  The  Fourier  transform  of  a  convolution  is  easily 
found  if  the  transforms  of  the  two  original  functions  are  known.  This  is 
shown  in  the  following  theorem. 

Theorem  2.4  Let  x(t),  y(t)  £  L^(Rl),  and  let  X(f)  and  Y( f)  be  the 
Fourier  transforms  of  x(t)  and  y(t).  Then  x*y  £  L*(R*)  and  its  transform 
is  X(f)  Y(f ). 

For  the  space  L^(Ri),  the  Fourier  transform  is  defined  to  be  a  limit 
as  shown  in  the  following  theorem  whose  proof  can  be  found  in  the 
references. ^*4 
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/  0(f)  df  =  1.  (2.2-12) 

—  CO 

Then  the  inverse  Fourier  transform  of  X(f)  is  §-summable  almost 
everywhere  to  x(t),  i.e., 

IZ  /  X(f)  ej2lTft  df  =  x(t)  a.e.  (2.2-13) 

P  _oo  P 

In  the  case  where  both  x(t)  and  X(f)  are  in  L^R1),  equation 
(2.2-11)  can  be  used  immediately.  This  is  stated  in  the  following 
theorem  whose  proof  can  be  found  in  the  references. 


Theorem  2.3  If  x(t)  and  X(f)  are  elements  of  L^(R^)  then 
00 

/  X(f)  ej2llft  df  *  x(t)  a.e.  (2.2-14) 

—  00 

Therefore,  x(t)  is  equal  almost  everywhere  to  a  function  in  I.l(Rl)  and 
cO(Rl).  If  x(t)  is  continuous  on  R* ,  then  inversion  formula  (2.2-11) 
holds  for  all  t  £  R^. 

The  space  L^(R^)  has  the  property  that  it  is  an  algebra  under  the 
operation  of  convolution  with  respect  to  the  complex  field.  In  other 
words,  if  the  functions  x(t)  and  y(t)  are  elements  of  L^R1)  then  the 
function  given  by 


00 

(x*y)(t)  *  /  x(t)  y(t-x)  dx 

—  00 


(2.2-15) 


is  in  l.l(Rl)  and  a(x*y)  =  (ax)*y  =  x*(ay)  for  all  a  R^  .  Showing 
x*y  L^R1)  is  easy  by  considering  the  integral  of  its  magnitude,  i.e., 


00 

/  | x*y |  dt 

—  00 


I 


00 

/  x(x)  y(t-x)  dx 
—  00 


dt 


<  II  |x(x)  y(t— x) |  dxdt. 


(2.2-16) 
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material  for  system  and  signal  analysis  in  electrical  engineering.  It  is 
left  to  the  reader  to  find  the  proofs  in  a  text  on  Fourier  analysis 
since  there  exists  a  number  of  good  books  on  the  subject.  Some  of  them 
are  listed  in  the  ref erences. 

The  Fourier  transform  of  a  function  is  defined  to  be 


00 

X(f)  *  /  x(t)  e  -*27T^C  dt. 


(2.2-9) 


The  Fourier  transform  is  known  to  exist  for  all  functions  in  L^(R^)  and 
by  the  following  theorem,  X(f)  is  shown  to  be  an  element  of  the  normed 
linear  space  C®(R^)«  C^(R^)  is  the  space  of  continuous  bounded  complex 


functions  over  R*  that  has  as  its  norm 
IIxUq  =  sup  |  x(t )  j . 


(2.2-10) 


Theorem  2. 1  Let  x(t)  £  L^R1).  Then  the  Fourier  transform  exists  and  is 
given  by 


X(f)  -  /  x(t)  e~j2irft 


(2.2-9) 


and  X(f)  is  an  element  of  C^(r1). 


The  inverse  Fourier  transform  of  X(f)  is  defined  to  be 


£(t)  -  /  X(f)  ej2lTft  df 


(2.2-11) 


and  by  the  following  two  theorems  is  known  to  exist  for  all  X(f)  that  are 
transforms  of  elements  of  L*(R*). 

Theorem  2.2  Let  x(t)  £  L^(R^).  Also,  let  9  be  a  positive  even  function 
on  Rl  and  monotonically  decreasing  on  (0,®)  where 


xQ  in  X  in  the  sense  Chat  Hxn  -  XqH  *  0  as  n  -*■  ®. 

Finally,  L^(r1)  has  the  additional  property  of  being  a  Hilbert  space 
which  is  a  Banach  space  that  has  defined  on  it  an  inner  product  between 
two  elements. 1*2  An  inner  product  is  an  operation  that  associates  a 
complex  number  to  each  pair  of  elements,  x  and  y,  in  the  space  and  is 
denoted  by  <f,g>.  It  has  the  following  properties  for  all  elements  x,  y, 
and  z  in  a  Hilbert  space  and  a  £  r1; 

i)  <x,y>  *  <y,x>*,  where  *  denotes  complex  conjugation 

ii)  <x+y,y>  »  <x,y>  +  <z,y> 

iii)  <ax,y>  =  a  <x,y> 

iv)  <x,x>  >  0  with  equality  holding  if  and  only  if  x  =  0. 

In  the  space  L^(Rl)  the  inner  product  can  be  defined  as 

CO 

<x,y>  =  /  x(t)  y*(t)  dt.  (2.2-7) 

— oo 

It  is  easy  to  verify  that  this  definition  does  obey  the  four  properties 
of  the  inner  product  listed  above. 

Lastly,  in  a  Hilbert  space  It  can  be  shown  that  /<x,x>  defines  a 
suitable  norm  in  that  it  obeys  the  four  properties  of  the  norm  listed 
earlier.  In  the  case  of  L^(r1)  the  norm  is  given  by 


U  xii  2 


A 


/<x,x> 


00 

1/2 

00 

= 

/  x(t)  x*(t)  dt 

3 

J  | x ( t ) |  dt 

—  00 

—  00 

1/2 

» 


(2.2-8) 


which  coincides  with  (2.2-6). 


2.2.2  The  Fourier  Transform  and  Convolution 
Presented  in  this  section  are  theorems  concerning  the  Fourier 
transform  and  convolution  of  functions  that  are  elements  of  the  spaces 
L*(r1)  and  l2(r1).  Most  of  these  theorems  are  part  of  the  foundational 
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Both  l1(r!)  and  L^(R^)  are  normed  linear  spaces.  In  general,  a 
normed  linear  space  X  is  a  space  on  which  exists  a  function  which  maps 
elements  of  X  to  R^-.  This  function  is  called  the  norm,  is  denoted  by  llxll 
where  x  £  X,  and  satisfies  the  following  axioms  when  x,  y  £  X  and  a  £  R* : 

i)  Hxll  >  0  with  equality  holding  if  and  only  if  x  =  9. 

ii)  II x+y II  <  llxll  +  Ily II 

iii)  II axil  =  |  a|  llxll. 

Property  (ii)  is  often  called  the  triangle  inequality  and  is  used  to 
prove  the  inequality 

ll x-y II  >  llxll  -  #yll.  (2.2-4) 

For  the  spaces  L^(R^)  and  L,2(r1)  the  norms  are  defined  to  be 
00 

U  x  ll  x  =  /  |  x  ( t )  |  dt  for  x(t)  £  Ll(Rl)  (2.2-5) 


It  can  be  verified  that  these  definitions  do  satisfy  the  three  properties 
of  the  norm  listed  above. 

The  norm  is  sometimes  interpreted  to  be  a  measure  of  an  element's 
distance  from  the  zero  element.  More  generally,  if  x  and  y  are  two 
elements  in  a  normed  space,  lix-yll  is  a  measure  of  the  distance  between 
them.  From  property  (i)  of  the  norm  it  is  seen  that  llx-yll  »  0  if  and 
only  if  x  =  y. 

Both  L,1(rI)  and  L^(R^)  are  Banach  spaces  because  they  are 

complete. A  Banach  (complete)  space  is  a  space  in  which  every  Cauchy 

00 

sequence  has  a  limit.  Thus  if  {x  }  ^  is  a  sequence  in  a  Banach  space  X 

and  Jxn-xmll  ♦  0  as  n  ♦  ®,  then  the  sequence  converges  to  some  element 
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In  either  L.'-(rI)  or  L,2(R.l),  two  elements  (functions),  f  and  g,  are 
considered  to  be  identifiable  if  they  differ  from  one  another  on  a  set  of 
measure  0.  In  this  case,  it  is  customary  to  write  f  i  g  or  f  =*  g  a.e. 
which  means  f  is  equal  to  g  'almost  everywhere.'  For  example,  consider 
the  function 

‘  2  ,  t  £  Z 

f(t)  --  (2.2-3) 

_  exp ( - 1 1  j  )  ,  t  £  RVz. 

Here,  the  function  f(t)  maps  R*  to  C  through  a  symetric  decaying 
exponential  except  when  t  is  an  integer.  Since  the  set  of  integers  Z  is 
a  countable  set,  it  has  measure  0,  so  f  =  exp(-|»|)  or  f  *  exp  (-|  *|) 

3  •  0  • 

Both  L^R1)  and  L2(Rl)  are  linear  spaces.  In  general,  a  linear 

space  X  is  a  set  of  elements  that  is  closed  under  the  operations  of 
addition  and  multiplication  with  respect  to  elements  in  a  scalar  field. 
For  example,  if  x,  y  £  X,  then  (x  +  y)  €  X,  and  if  a  is  a  scalar  then 
ax  £  X.  Additionally,  the  linear  space  X  satisfies  the  following  axioms 
when  x,  y,  z  £  X  and  a,  3  £  R^: 

i)  Commutivity:  x  +  y  =*  y  +  x 

ii)  Associativity  under  addition  (x  +  y)  +  z  ■  x  +  (y  +  z) 

iii)  The  existence  of  a  zero  element  9  such  that  x  +  9  *  x 

iv)  Associativity  under  multiplication:  a(Sx)  =  (aB)x 

v)  Distribution:  a(x  +  y)  =  ax  +  By 

vi)  The  existence  of  the  scalar  0  and  1  such  that  Ox  =  9  and 
lx  =  x. 

It  should  be  noted  that  subtraction  is  done  using  the  scalar  a  =»  -1, 
which  in  the  scalar  field  is  the  additive  inverse  of  1.  The  element  -lx 
is  denoted  -x  and  so  x  -  x  ■  x  +  (-lx)  *  (1-1)  x  ■  Ox  a  9. 


also  show  under  what  condition  a  linear  time-varying  system  produces  a 
stable  output.  Stating  this  statiblity  criterion  allows  additional 
properties  of  linear  time  varying  system  functions  to  be  discussed  which 
were  not  developed  in  Ziomek's  original  work. 

Readers  with  a  background  in  mathematics,  electrical  engineering,  or 
acoustics  will  probably  find  that  sections  2.2  and  2.3  contain  no 
material  unfamiliar  to  them.  Those  already  knowledgeable  in  the 
mathematical  theory  of  doubly-spread  communication  channels  (in  the 
context  of  scattering  functions)  can  proceed  to  Chapter  3. 

2.2  Mathematical  Background 

2.2.1  The  Spaces  Ll(Rl)  and  L2(Rl) 

In  the  mathematical  development  of  the  theory  presented  in  this 
thesis  all  functions  used  to  represent  signals  will  be  elements  of  the 
spaces  1.1  (R1 )  and  L2(r!).  The  space  Ll(Rl)  is  defined  to  be  the  class  of 
all  Lebesque  measurable  functions  that  map  the  field  of  real  numbers  R1 
(the  real  line),  to  the  field  of  complex  numbers  C  (the  complex  plane), 
and  are  magnitude  integrable.  A  more  compact  notation  for  this  class  of 
functions  is 

CO 

Ll(Rl)  =  {f:  Rl  ♦  C  |  /  | f (t) |  dt  <  »}.  (2.2-1) 


Similarly,  the  space  L2(Rl)  is  defined  to  be  the  class  of  all  Lebesque 
measurable  complex  functions  that  map  R*  to  C  and  are  square  integrable. 
Again,  a  more  compact  notation  for  this  class  of  functions  is 
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CHAPTER  2 

TIME-VARYING  SYSTEM  THEORY 

2. 1  Introduction 

The  primary  objective  of  this  chapter  is  to  introduce  the 
mathematical  fundamentals  that  will  be  used  throughout  this  thesis. 
Section  2.2.1  presents  a  brief  review  of  the  properties  of  linear  spaces. 
In  particular,  the  spaces  L^(R^)  and  L,2(r1)  will  be  discussed  since  their 
elements  are  functions  that  will  be  used  to  model  signals.  In  section 
2.2.2  the  Fourier  transform  is  introduced  and  several  theorems  are  stated 
concerning  the  Fourier  transforms  of  functions  in  L^(R^)  and  l2(r1). 
Section  2.3  presents  a  review  of  linear  system  theory  for  both 
time-varying  and  time-invariant  systems.  The  chapter  first  presents  the 
output  of  a  deterministic  linear  time-invariant  system  as  a  convolution 
integral  between  the  input  waveform  and  the  system  impulse  response 
function.  This  is  then  extended  to  find  the  output  of  a  deterministic 
linear  time-varying  system  which  is  described  by  a  time-varying  impulse 
response  function.  Finally,  Section  2.4  describes  four  autocorrelation 
functions  for  time-varying  systems  for  both  deterministic  and  random 
cases.  For  random  linear  time-varying  systems  it  is  shown  that  the 
autocorrelation  functions  lead  to  the  definition  of  the  scattering 
function  provided  the  system  is  used  to  model  a  wide  sense  stationary 
uncorrelated  channel  (WSSUS). 

It  should  be  mentioned  that  the  derivation  of  the  scattering 
function  as  developed  from  random  linear  time-varying  system  theory  was 
originally  done  by  Ziomek^  in  his  Ph.D.  dissertation.  What  has  been 
done  in  this  chapter  is  to  not  only  present  the  same  derivation  but  to 


Finally,  Chapter  4  introduces  an  iteration  algorithm  to  optimize  the 
expected  matched  filter  receiver  response  to  a  signal  transmitted  through 
a  channel  having  a  known  scattering  function.  The  algorithm  is  derived 
using  the  calculus  of  variations  and  generates  a  series  of  processing 
signals  which  in  turn  produces  a  convergent  monotonically  increasing 
series  of  expected  matched  filter  outputs.  The  chapter  closes  with 
several  numerical  examples  of  the  optimization  technique. 
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known.  The  optimization  is  accomplished  by  iteratively  altering  the 
processing  signal  of  the  matched  filter  given  a  fixed  transmit  signal  and 
channel  scattering  function.  Since  the  performance  of  a  receiver  is 
determined  by  the  signal  to  noise  ratio  (SNR)  of  its  output  (in  this  case 
defined  as  the  ratio  of  the  receiver  signal  response  to  white  Gaussian 
noise  and  clutter  or  reverberation  response),  increasing  the  receiver 
response  to  the  channel  output  without  significantly  increasing  the 
response  to  the  interference  will  increase  the  SNR  and  imply  an  improved 
ability  of  the  receiver  to  detect  the  channel  output  signal.  If  the  only 
interfering  signal  is  white  Gaussian  noise,  then  increasing  the  receiver 
signal  response  guarantees  an  increase  in  SNR. 

It  is  important,  however,  to  present  the  fundamental  theory  used  to 
establish  the  properties  of  the  communication  channel  and  performance  of 
the  receiver.  Therefore,  Chapter  2  opens  with  a  review  for  the 
properties  of  linear  spaces  and  Fourier  analysis.  The  remainder  of  the 
chapter  presents  the  development  of  linear  time-varying  system  theory  for 
both  the  deterministic  and  stochastic  cases.  This  eventually  leads  to 
the  derivation  of  the  scattering  function  which  can  be  used  to  describe 
the  time  delay  and  frequency  spreading  characteristics  of  the 
communication  channel. 

Chapter  3  introduces  the  theory  of  matched  filter  receivers  and 
shows  their  use  in  hypothesis  testing  for  signal  detection.  The 
relationship  between  scattering  function  and  receiver  output  is 
developed.  This  is  used  in  the  closing  section  of  the  chapter  to  show 
the  conditions  necessary  for  the  matched  filter  to  optimally  detect  a 
signal  that  has  been  transmitted  through  a  channel  whose  scattering 


function  is  known. 
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y i ( C )  =  h( )  xCt-Ti)  At  .  (2.3-2) 

It  follows  that  as  At  becomes  smaller  the  approximation  of  y^(t)  by 
(2.3-2)  becomes  more  accurate.  Thus  in  the  limit 

y. (t)  =  lim  h( t . )  x(t— t  )  At  .  (2.3-3) 

1  At+0 

The  desired  system  response,  however,  is  the  sura  of  all  the  responses  to 
the  approximating  pulse  train,  so 

y(t)  *  l  y. (t)  =  l  h( t, )  x(t— t, )  At  .  (2.3-4) 

i  i 

Again,  as  At  becomes  small  it  follows  that  the  approximation  of  y(t)  by 
(2.3-4)  becomes  more  accurate.  Furthermore,  the  summation  becomes  an 
integral  leading  to  (2.3-1). 

It  should  be  noted  that  from  (2.3-1)  it  immediately  follows  that 
h(t)  is  the  system  response  to  an  impulse  of  unity  weight,  i.e., 

00 

y(t )  =  /  h(t)  5(t-t)  dt  =  h(t).  (2.3-5) 

—  oo 

As  mentioned  earlier,  (2.3-1)  describes  a  system  whose  behavior  is 
invariant  with  respect  to  time.  A  clearer  statement  of  this  can  be  made 
if  one  considers  the  response  of  the  system  to  an  impulse  applied  at  some 
time  t0  *  0  (x(t)  =  5(t-t0)).  Using  (2.3-1),  the  system  response  is 
00 

y(t)  =  /  h( t)  S(t-t0-T)  dt  =  h(t-tQ) ,  (2.3-6) 

—  OO 

which  is  just  the  system  impulse  response  shifted  forward  in  time  an 
amount  t0.  Outside  of  the  fact  that  the  response  exhibits  a  time  shift, 
it  has  the  same  form  as  the  function  h(t).  Therefore,  the  properties  of 
the  system  remain  the  same  regardless  of  when  the  input  is  applied. 


In  the  real  world,  however,  communication  channels  are  never  blessed 
with  the  property  of  time-invariance.  Their  responses  change  over  a 
period  of  time  depending  on  what  mechanisms  carry  the  signals  in  their 
respective  mediums. 

To  account  for  the  time-varying  property  of  a  linear  system,  the 
convolution  integral  determining  the  response  can  be  rewritten  as 

CO 

y(t)  =  /  h( t, t )  x(t-x)  dt  ,  (2.3-7) 

—  00 

where  h(x,t)  describes  the  system  response  at  time  t  to  an  Impulse  of 
unity  weight  applied  at  time  t  -  t.  In  this  case  t  can  be  thought  of  as 
the  variable  that  describes  the  'antiquity'  or  'age'  of  the  impulse 
input. 9  By  defining  the  impulse  response  in  this  way,  it  will  be  shown 
later  that  a  symmetry  will  exist  between  h(x,t)  and  its  Fourier  transform 
with  respect  to  x. 

One  restriction  that  will  be  placed  on  the  system  described  by 
h(x,t)  is  that  every  bounded  input  will  produce  a  bounded  response.  Such 
systems  are  referred  to  as  'bounded  input,  bounded  output'  systems 
(BIBO).  The  conditions  for  a  system  to  be  a  BIBO  system  are  stated  in 
terms  of  the  impulse  response  and  are  given  in  the  following  theorem. 

Theorem  2.9  Let  a  time-varying  system  be  described  by  the  impulse 
response  h(x,t).  Then  the  system  is  a  BI30  system  if  and  only  if 
00 

/  I h(T»t) I  dx  <  »  (2.3-8) 

—  00 

for  all  t  £  R1. 

Proof .  Proving  the  theorem  in  the  reverse  direction  is  easy.  Suppose 
the  system  input  x(t)  is  bounded,  i.e., 


r  r 


|x(t)|  <  M  for  C 


then  for  any  t 


y(t)|  =  /  h( x, t )  x(t-i)  dt 


(2.3.9) 


<  /  | h( x, t ) |  |x(t-x)|  dx 


<  M  /  |h(x,t)|  dx  <  ®  . 


(2.3-10) 


Hence,  any  bounded  input  produces  a  bounded  output. 

Proving  the  theorem  in  the  forward  direction  is  done  by 
contradiction.  Suppose,  a  system  is  a  BIBO  system  but  its  impulse 
response  does  not  satisfy  (2.3-8).  This  can  be  stated  as 


sup  /  jh(x,t) |  dx  =  «  . 
t  — °® 


(2.3-11) 


Therefore,  for  any  M  >  0,  there  exists  a  tji  such  that 


/  ! h C t , tM ) |  dx  >  M 


(2.3-12) 


Now  define  the  bounded  input  function 
xM(t)  =  sgn[h(tM-t,tM)j 


(2.3-13) 


which  is  equal  to  the  sign  of  the  system  response  at  time  t^j  to  an 
impulse  of  unity  weight  applied  to  the  system  input  at  time  t.  The 
response  to  x^(t)  at  time  tj^  is 


y(tM)  -  I  h(x,tM)  sgn(h(tM-tM+x,tM)]  dx 

— oo  ‘  1 

QO 

=  /  lh(T,t>I)|  dx  >  M 


(2.3-14) 


Therefore,  given  any  M  >  0,  however  large,  we  can  construct  a  bounded 


input  which  will  produce  an  output  that  at  some  time  is  larger  than  M. 
This  contradicts  the  hypothesis  that  the  system  is  a  BIBO  system,  and  so 


the  theorem  is  proved. 

For  linear  time-invariant  systems  h(t,t)  reduces  to  h(t)  and  by 
(2.3-8) 

ao 

Uhl!  L  -  /  | h(  t)  |  d t  <  00  .  (2.3-15) 

—  CO 

Thus,  h(x)  is  an  element  of  L^(R^)  which  proves  the  following  corollary: 

Corollary  2.10  Let  a  time-invariant  system  be  described  by  the  impulse 
response  h(t).  Then  the  system  is  a  BIBO  sytem  if  and  only  if  h(t)  is  an 
element  of  L^(R^). 


It  is  now  possible  to  solve  for  the  response  of  a  time-varying 
system  using  Fourier  analysis.  Since  the  input  x(t)  is  assumed  to  be 
an  element  of  L^(R^)  and  l2(r1),  its  Fourier  transform  exists  because 
the  transform  exists  for  all  functions  in  either  space.  Therefore,  using 
(2.2-11)  and  (2.3-7), 


y(t )  =  /  h(r,t)  x(t-r)  d x  =  /  h( t,t) 


/  X(f)  ej2ltf(t  T)  df 


d  t 


-  / 


/  h(  t,  t )  e  ^2lTfT  (jT 


X(f)  e 


j2uft 


df. 


(2.3-16) 


Notice  that  the  bracketed  term  in  the  integrand  of  (2.3-16)  is  of  the 
same  form  as  a  Foutier  transform  of  a  function  of  a  single  variable.  It 
is,  in  fact,  the  Fourier  transform  of  h(x,t)  with  respect  to  t  if  one  is 
allowed  to  define  the  transform  of  a  multivariable  function  with  respect 
to  a  single  variable.  Here,  the  bracketed  terra  will  be  defined  as 
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H(f,t)  =  /  h(T,t)  e  j-1TfT  dt 


(2.3-17) 


Furthermore,  H(f,t)  exists  and  is  bounded  because  by  (2.3-8) 


uu 

[  H(  f ,  t )  j  =  /  h(t,t)  e  "J2^fT  dT 


<  /  |  h(  t  ,  t )  |  dt  <  «  . 


(2.3-18) 


Now  (2.3-16)  can  be  rewritten  as 


OO  OP 

y(t )  =  /  h(i,t)  x(T-t)  dt  -  /  H(f , t )  X(f)  ej2*ft  df.  (2.3-19) 


It  appears  that  the  system  response  is  the  inverse  Fourier  transform 
of  the  function  H(f,t)  X(f).  This  is  not,  in  fact,  true,  because  the 
Fourier  transform  of  the  response  y(t)  regardless  of  whether  the  system 
is  time-varying  or  time-invariant  is 


Y(f)  -  /  y(t)  e'j2lTft 


(2.3-20) 


which  is  not  a  function,  of  both  f  and  t.  The  only  time  Y(f)  =  H(f,t) 
X(f)  is  when  the  system  is  time-invariant.  In  this  case,  h(t,t)  -*■  h(  t) 
which  means  H(f,t)  *  H(f),  so  (2.3-19)  reduces  to 


y(t)  -  /  h( t)  x(x-t)  dx  =  /  H(f)  X(f) 


J2irft 


df.  (2.3-21) 


This  also  follows  from  Theorem  2.4  which  states  that  the  convolution  of 
two  functions  is  equal  to  the  inverse  Fourier  transform  of  the  product  of 
the  Fourier  transforms  of  the  two  functions.  Because  y(t)  is  given  by 


y(t)  =  /  Y(f)  ej2m  df, 


(2.3-22) 


and  because  of  the  uniqueness  of  the  Fourier  transform,  Y(f)  *  H(f)  X(f). 


It  must  be  stressed,  however,  this  is  only  guaranteed  for  time-invariant 
systems. 

The  function  H(f,t)  can  be  interpreted  to  be  the  modulation  of  a 
single  frequency  (monochromatic)  signal  applied  to  the  system.  This 
can  be  seen  by  finding  the  response  to  the  signal  x(t)  *  eJ2nf0t.  From 
(2.3-7)  and  (2.3-17), 


y(t)  *  /  h(r,t)  e 


j27Tf  (t-T) 


dT 


/  h(T,t) 


-j2irf  t 


di 


=  H(f,t)  e 
o 


j2irfot 


(2.3-23) 


If  H(f0,t)  changes  with  respect  to  t  (time),  it  follows  that  the  amplitude 
of  the  input  signal  is  modulated.  Since  H(f,t)  can  be  used  directly  to 
find  the  response  to  a  monochromatic  signal,  it  is  often  referred  to  as 
the  time-varying  transfer  function  or  the  time-varying  spectrum  of  the 
system  impulse  response. 


2.3.2  The  Space  L^(R^)  and  the  Time-Varying  System  Functions 
It  was  shown  in  the  last  section  that  if  the  time-varying  system 
impulse  response  h(t,t)  has  the  property 


/  | h( t , t ) |  dx  <  «  for  all  t  £  Rl 


(2.3-8) 


then  it  describes  a  BIBO  system  and  the  time-varying  transfer  function  is 
defined  as 


H(f , t )  =  /  h(x,t)  e_:12lTfT  dx  . 


(2.3-17) 


A  question  that  naturally  arises  is:  Can  the  Fourier  transform  of  h(T,t) 
with  respect  to  t  be  found  and  what  is  its  interpretation?  Also,  is  it 
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possible  to  find  the  two-dimensional  Fourier  transform  of  h(r,t)? 

An  affirmative  answer  to  these  questions  requires  an  additional 
restriction  on  h(t,t). 

It  will  be  assumed  that  the  time-varying  impulse  response  h(t,t)  is 
an  element  of  the  normed  linear  space  L*(R2).  The  space  L^(R^)  is  the 
class  of  all  Lebeque  measurable  functions  that  map  the  set  of  real 
ordered  pairs  R^  to  the  complex  field  C,  and  are  magnitude  integrable, 
i.e.  , 

00 

L^R2)  =  {x:  R2  C  |  //  j  x  (  x ,  t )  j  dxdt  <  •}  .  (2.3-24) 

—  00 

The  norm  of  any  element  x(t)  in  L^(R2)  is  defined  as 

OO 

II x II ^  ■  //  | x( T, t ) |  dtdt  .  (2.3-25) 

—  00 

The  two-dimensional  Fourier  transform  of  any  element  x(t)  in  L^(R2)  is 
given  by 

X(f,$)  =  //  x(x,t)  e~j2ir(fT+^t)  dTdt  (2.3-26) 


and  is  bounded  because 


X(f,*)| 


//  X(t,t)  a-j2’<£T+W)  dTdt 


<  /  J  j  X  (  T  ,  t )  |  dxdt  =  II X  II < 


(2.3-27) 


An  interesting  and  useful  property  of  the  space  L^(R2)  is  that  if 
h(x,t)  is  an  element  of  L^(R^)  then 


/  | h( r , t ) |  dx  <  «  and  /  |h(x,t)|  dt  <  »  . 


(2.3-28) 


This  property  arises  from  Fubini's  reduction  theorem  for  double 
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integrals  and  it  should  be  noted  that  its  converse  is  not  true.  A 
consequence  of  this  property  is  that  it  is  possible  to  find  the  one¬ 
dimensional  Fourier  transform  of  h(x,t)  with  respect  to  either  t  or  t. 

The  Fourier  transform  of  h(x,t)  with  respect  to  t  exists,  is 
given  by 

S(x,<f>)  =  /  h(i,t)  e“j2nt  dt  (2.3-29) 


and  is  bounded  for  all  t  and  4>  because  by  (2.3-8)  it  is  easily  seen  that 


:S(x,*)|  =* 


/  h(t,t)  e-^2*^  dt 


<  /  | h( t, t ) |  dt  <  «  . 


(2.3-30) 


The  function  S(x,$)  is  referred  to  as  the  spreading  function^  and  can  be 
interpreted  as  a  measure  of  the  system  time  variation.  It  follows  that 
if  S(  t ,  <J> )  is  significantly  large  over  a  wide  range  of  <J>,  then  the  system 
is  rapidly  time  varying.  Otherwise,  if  S(x,4>)  is  significant  only  over  a 
small  range  of  centered  around  4>  =  0,  then  the  system  changes  slowly 
with  time. 

Another  description  of  a  time-varying  system  can  be  given  by  finding 
the  Fourier  transform  of  the  time-varying  transfer  function  H(f,t)  with 
respect  to  t ,  i. e.  , 


B(f ,>) 


S  /'  H(f , t )  dc  . 


(2.3-31) 


This  is  also  equivalent  to  finding  the  two-dimensional  Fourier  transform 
of  h(x,t),  because  if  H(f,t)  is  replaced  by  (2.3-17)  then 

00 

B(f , $)  =  H  h(x,t)  e-j27,(fT+^)  dTdt  .  (2.3-32) 


—  CO 
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Furthermore,  B(f,<j>)  is  bounded  because 


|B(f,4>)| 


j]  h(T.t)  drdc 


<  //  | h ( T,t)|  dtdt  <  »  .  (2.3-33) 

—  oo 

The  function  B(f,4>)  is  referred  to  as  the  bi-frequency  function*®  and  can 
be  interpreted  as  a  measure  of  the  amount  of  modulation  the  system  output 
exhibits  when  the  input  is  a  monochromatic  signal.  For  example,  in 
section  2.3.1  it  was  shown  that  if  the  system  input  is  eJ2^f0t  then  the 
output  is 


j  2irf  t 

y (t )  *  H(fQ,t)  e  °  .  (2.3-34) 

Therefore,  H(f0t),  the  system  transfer  function  evaluated  at  fQ,  is  the 
modulation  function  of  the  output  signal.  The  Fourier  transform  of  the 
modulation  signal  with  respect  to  t  is  then 


/  H(f  ,t)  dt  =  B(f  ,<(>)  (2.3-33) 

o  o 


which  is  just  the  bi-frequency  function  evaluated  at  fQ  with  left  as  a 
free  parameter.  At  this  point  the  interpretation  of  the  bi-frequency 
function  is  clear.  If  the  modulation  of  output  signal  changes  rapidly 
with  time,  then  B(f0,$)  is  significant  over  a  wide  range  of  <}>.  On  the 
other  hand,  if  the  modulation  changes  slowly  with  time  as  a  result  of  the 
system  varying  slowly  with  time,  then  B(f0,ij>)  is  significant  for  a  small 
range  of  <j>  centered  about  =  0. 

The  bi-frequency  function  can  also  be  found  by  determining  the 
one-dimensional  Fourier  transform  of  S(t,4>)  with  respect  to  t.  This  can 
be  seen  by  finding  the  Fourier  transform  of  (2.3-29) 
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/  S( t , <j>)  e_:i2lTfT  dt 

—CO 


/ 


/  h(  t, t )  e  ^2tt^c  dt 
—  00 

It  h(t,t)  e-J2”<f,^c)  dtdt 


-j  2nf  t 
e  J  dx 


(2.3-36) 


which  is  the  same  as  (2.3-32). 

Figure  3  shows  the  relationship  among  the  four  time-varying  system 
functions.  Each  function  can  be  used  to  characterize  the  system  because 
if  any  one  of  the  functions  is  known  then  it  is  only  a  matter  of  Fourier 
transforming  (or  inverse  Fourier  transforming)  with  respect  to  the 
correct  variables  to  find  any  of  the  remaining  three  system  functions. 

The  bi-frequency  function  can  be  used  to  find  the  Fourier  transform 
of  the  output  signal  y(t).  From  (2.3-19),  (2.3-31),  and  the  definition 
of  the  Fourier  transform, 


Y(n)  =*  /  y(t)  e  j2:Tnc  cjt 
—  00 

-  //  X(f)  H( f , t )  e-2 2iI( n-f )t  dfdt 


/  X(f) 


f  H(f.t)  e’j27r(n-f)t  dt 


df 


=  /  X(f)  B(f , n-f)  df 


(2.3-37) 


Thus  Y(n)  can  be  found  by  convolving  X(f),  the  Fourier  transform  of  the 
input  x(t),  with  the  function  B(f,n-f).  The  function  B(f,n-f)  is  the 
projection  of  B(f,j>)  on  the  line  $  =  n-f  to  the  line  =  0.  This  is 
shown  in  Figure  4. 
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I 
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2.4  Random  Time-Varying  System  Theory 

2.4.1  System  Autocorrelation  Functions 

To  this  point  the  time-varying  system  described  by  the  time-varying 
impulse  response  h(x,t)  has  been  deterministic.  In  other  words,  h(i,t) 
is  known  precisely  for  all  T  and  t.  For  real  world  systems  this  is  never 
the  case,  so  the  behavior  of  h(x,t)  and  the  three  other  time-varying 
system  functions  must  be  described  by  their  statistical  properties.  This 
is  done  by  considering  the  autocorrelation  functions  of  all  the 


time-varying  system  functions.  They  are 

Rh(x,x',c,t')  =  E (h( x, t )  h* ( x ' , t 1 ) }  (2.4-1) 

RH(f,f\t,t')  =  E{H(f,t)  H*  ( f  '  >  t ' )  }  (2.4-2) 

Rs(t,T,J,V)  =  E{S(x,«>)  S*(x',1>')}  (2.4-3) 

RB(f,f’,<J>,4>’)  -  E{B(f,4>)  B*(f\<t>’)}  (2.4-4) 


Ziomek^  has  shown  that  with  the  correct  interpretation  the  four 
time-varying  filter  functions  are  related  through  two-dimensional  Fourier 
transforms.  For  example,  consider  the  spreading  function  autocorrelation 
function  which  by  (2.3-29)  and  (2.4-3)  is 


Rs(x,x\>,?)  =  E{  /  h(x,t)  e'j27T?t  dt  /  h*(x’,t')  ej2lTrt'  dt  ’  } 


=  E{//  h(x,t)  h*(x',t')  e"j27T(^"1>'t')  dtdt’ }  (2.4-3) 


Because  the  expectation  operator  and  integration  are  linear  operations, 
(2.4-5)  can  be  rewritten  as 


Rs(x,x’,j,V)  =  ;/  E{h(x,t)  h*(  x’  ,t ' ) }  e  j2l,(^  dtdt’ 


=  //  Rh(x,x',t,t') 


-j2^(  d>t  —  '  t ' ) 


dtdt ' . 


(2.4-6) 
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Note  chat  (2.4-6)  does  not  express  Rg(  f,  t 4> * )  as  a  two-dimensional 
Fourier  transform  of  R^( x , x ' , t , t ' ) ,  even  though  it  has  a  similar  form. 
This  is  because  the  two-dimensional  Fourier  transform  of  R^( x, x' , t , t ' ) 
is  actually  given  by 


ao 

//  VT,T 

—  <x> 


,t,t') 


e  —  j  2  TT  (  i)t  +  ij)'t  '  ) 


dtdt ' . 


(2.4-7) 


The  difference  between  (2.4-6)  and  (2.4-7)  can  be  seen  in  the  argument  of 
the  exponential  function  in  the  integrand. 

The  discrepancy  can  be  cleared  up  redefining  the  Fourier  transform 
with  respect  to  x '  and  t'  as 


CO 

/  h(x ' 

— ao 

'.to  eJ2"£,t' d,’ , 

(2.4-8) 

oo 

/  h(  x ' 

—  CO 

,f)  eJ2’,4,'t'  dC  . 

(2.4-9) 

Thus,  by  using  (2.4-9),  (2.4-6)  is  the  two-dimensional  Fourier  transform 
of  Rh( x , t ' , t , t ' )  which  can  be  written  as 

Rh(x,x'  ,t,t')  <=>  Rs(x,x*. ♦,$•).  (2.4-10) 

t.t* 

Similarly,  using  (2.4-8)  and  (2.4-9)  the  relationships  between  all  the 
system  autocorrelation  functions  are 


Rh(x,x',t,t')  <=>  RH(f  ,f  '  ,t,t’ ) 
T  ,  X  ' 


Rh(x,x'  ,t,t ' )  <==>  RB(f  ,f  '  ,$,<?' ) 
x.x'.t.t’ 


(2.4-11) 

(2.4-12) 


ls(x,x',l>,6')  <=>  Rg(f  ,f’,*,$') 


(2.4-13) 


x ,  x ' 


RH(f,f’,t,t') 


<=> 

t,f 


RB(f ,f ' 


■t>,  V). 


(2.4-14) 


Figure  5  illustrates  the  relationships  among  all  four  autocorrelation 
f  unctions . 
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where  C?  is  the  radial  speed  referenced  away  from  the  scatterer  and  C  is 
propagation  speed  of  the  medium.  It  can  be  shown  that  the  Fourier 
transform  of  (3.2-11)  is  given  by 

Y(f)  =  J7T  X(s}  ej2lTfT’  (3.2-14) 

The  effect  the  doppler  variable  has  on  the  spectrum  of  the  input 
signal  is  illustrated  in  Figure  8.  For  a  scatterer  traveling  away  from 
the  receiver,  the  doppler  variable  is  less  than  one,  causing  the  spectrum 
to  move  toward  lower  values  of  f.  This  occurs  because  the  point 
scatterer  is  traveling  with  the  expanding  wavefront  of  the  transmitted 
signal,  so  it  is  irradiated  by  the  wavefront  for  a  period  of  time  longer 
than  the  duration  of  the  signal.  This  causes  the  reflected  signal  to  be 
a  time  stretched  version  of  the  transmit  signal  changing  slower  in  time 
and  consequently  composed  of  exponential  signals  of  lower  frequency.  A 
similar  analysis  holds  when  the  point  scatterer  moves  toward  the 
receiver.  In  this  case  the  scatterer  is  irradiated  by  the  expanding 
transmitted  wavefront  for  a  period  of  time  less  than  the  transmit  signal 
time  duration.  Consequently,  the  signal  is  compressed  in  time,  changes 
more  rapidly  than  the  transmit  signal  and  so  is  composed  of  higher 
frequency  exponential  signals.  This  is  also  seen  from  Figure  8,  where  the 
doppler  variable  is  greater  than  one  causing  the  spectrum  to  shift  to 
higher  values  of  f. 

Another  phenomenon  induced  by  the  doppler  variable  s  is  a  change  in 
bandwidth  for  bandpass  transmit  signals.  If  the  spectrum  of  the  transmit 
signal  is  significant  only  in  the  interval  ( f i_ , f 2 )  then  the  spectrum  of 
the  received  signal  is  significant  in  the  interval  (sfi,sf2)«  This 
occurs  because  the  spectrum  of  the  transmit  signal  is  either  stretched  or 


v  ■  ■■  ■■■ 


V 


▼ 


-1. 
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constant  b  in  (3.2-9)  is  not  a  random  variable.  In  either  the  random  or 
deterministic  case  the  idea  is  the  same;  if  spreading  or  scattering 
occurs  at  only  one  x  and  then  the  system  output  is  a  scaled  time  and 
frequency  shifted  replica  of  the  input  signal.  The  time  shift  x  is 
evident  by  examining  the  argument  of  (3.2-5)  and  (3.2-9).  Finding  the 
Fourier  transform  of  (3.2-5)  or  (3.2-9)  gives  the  spectrum  of  the  channel 
output  signal  as 

Y(f)  =  b  X(f-$)  ej2Trft  (3.2-10) 

where  X(f)  is  the  spectrum  of  the  transmit  signal.  Examination  of  the 
right  side  of  (3.2-10)  shows  the  frequency  shifting  of  the  transmit 
signal  spectrum  by  the  amount  $. 

The  reflection  from  a  point  scatterer  that  has  a  given  range  and 
velocity  can,  under  certain  conditions,  be  modeled  by  a  random  linear 
time-varying  system  whose  output  is  a  scaled  version  of  its  input 
having  a  specific  time  shift  x  and  frequency  shift  $.  This  is  why  the 
scattering  function  described  by  (3.2-1)  was  considered.  A  more  accurate 
modeling  of  the  point  scatterer  reflection  can  be  found  by  considering 
Figure  7  where  the  geometry  of  a  monostatic  signal  detection  system  is 
shown.  Here  the  reflection  of  the  transmit  signal  is  given  by 

y(t)  =  b  x(s(t-x))  (3.2-11) 


where  x  is  the  range  delay,  s  is  the  doppler  variable,  and  b  is  a  random 
variable.  Applying  simple  physics  to  the  geometry  it  can  be  shown 
that  ^ 


l  -  cT/c 
i  +  cT/c 


(3.2-12) 

(3.2-13) 
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Ry  ( t ,  t ' )  =  E{  b  x(t-x)  e^2lT^C  b  x(t'-x)  e-’2*'^  *}.  (3.2-4) 

Thus  Che  system  output  can  be  interpreted  as 

y(t)  =  b  x( t— x )  ej27r4,t.  (3.2-5) 

Time-varying  systems  whose  outputs  are  of  the  form  of  (3.2-5)  are  used 
to  model  a  non-dispersive  communication  channel. 

A  similar  result  can  be  derived  by  considering  a  time-varying  system 
for  which  the  spreading  function  S (  t ,  )  is  known.  It  was  shown  in 
section  2.3.1  that  the  system  output  is  given  by 
00 

y(t )  =  /  h( x , t )  x( t — t )  dx.  (2.3-7) 

—  ao 

Also,  from  section  2.3.2  the  time-varying  impulse  response  is  related  to 
the  spreading  function  by 

00 

h(T,t)  =  /  S( t, <(>)  e-*21"**  d<|>.  (3.2-6) 

— oo 

Combining  (2.3-7)  and  (3.2-6)  gives  the  output  as 
0° 

y(t)  =  //  S( x, 4>)  x(t-x)  ej2lT4>t  d<j>dx.  (3.2-7) 

—  OO 

If  the  spreading  function  is  of  the  form 

S(x,<j>)  =  b  S(x-x)  5(<|>-$)  (3.2-8) 

then  the  system  output  is 
00 

y(t)  =  //  b  6(x-x)  6(tj>-<£)  x(t-x)  e^2^1"  d 4>d x 

— OO 

=  b  /  <5(<f>-$)  x(t-t)  ej2ir*C  d 4> 

—00 

-  b  x( t  —  x )  ej27r^C.  (3.2-9) 

Notice  that  (3.2-5)  and  (3.2-9)  are  of  the  same  form  except  that  the 


Finally,  in  section  3.3  it  is  shown  that  the  expected  matched  filter 
receiver  output  is  maximized  for  a  given  communication  channel  if  the 
cross-ambiguity  function  and  the  channel  scattering  function  are  in 
constant  proportion. 


3.2  Signal  Detection 

3.2.1  Propagation  Modeling 

Consider  a  linear  random  time-varying  system  which  models  a  WSSUS 
channel  and  has  a  scattering  function  of  the  form 


Rs(t,$)  «  K  S(t-t)  6(  4>-«>)  . 


(3.2-1) 


In  section  2.4.2,  the  scattering  function  was  interpreted  as  a  density 
function  which  determines  the  amount  of  delay  r  and  frequency  shift  4>  an 
input  signal  will  exhibit  at  the  output  of  the  system.  If  the  scattering 
function  has  the  same  form  as  (3.2-1),  then  it  implies  that  the  system 
output  will  be  a  replica  of  the  input  having  time  shift  T  and  frequency 
shift  *>.  This  can  also  be  implied  by  determining  the  system  output 
correlation  using  (2.4-33)  and  (3.2-1),  i.e., 


R  (tt,  t  * )  -  K  //  x(t-x)  6(  t-t)  S(4>-$)  x*(t'-x)  ej2ir<l>(At)  d  rd  d> 
y  —co 

=■  K  /  x(t-t)  <5($-$)  x*(t'-x)  gj^'KAt)  d 
—00 


-  K  Jl(t-T)  x*(f-?) 

.  K 

Defining  the  constant  K  as 


(3.2-2) 


K  =  E{ | b| 2)  =  E (b  b*} 


(3.2-3) 


then  (3.2-2)  can  be  rewritten  as 


CHAPTER  3 


MATCHED  FILTERING  IN  SIGNAL  DETECTION 


3. 1  Introduction 

In  this  chapter,  the  detection  of  a  signal  that  has  been  transmitted 
through  a  communication  channel  whose  scattering  properties  are  known  is 
developed.  Detection  will  be  performed  by  examining  the  output  of  a 
matched  filter  receiver.  In  this  receiver  structure,  the  channel  output 
signal  is  multiplied  by  a  processing  signal,  the  product  integrated,  and 
the  square  of  the  magnitude  of  the  integrator  output  is  used  as  the 
receiver  output.  If  the  receiver  output  exceeds  a  predetermined 
threshold,  then  it  is  assumed  that  a  portion  of  the  channel  output 
contains  a  transmitted  signal;  otherwise,  if  the  threshold  is  not 
exceeded  then  it  is  assumed  that  no  signal  is  present.  This  is  also 
referred  to  as  a  likelihood  ratio  test.  It  will  be  shown  that  the 
expected  value  of  the  matched  filter  response  to  a  channel  output 
containing  a  response  to  a  transmitted  signal  can  be  written  in  terms  of 
the  channel  scattering  function. 

In  section  3.2.1,  the  modeling  of  a  signal  transmitted  through  a 
non-dispersive  channel  (point  scatterer)  as  a  time  and  frequency  shifted 
version  of  the  original  transmit  signal  is  presented,  and  the  conditions 
necessary  for  this  modeling  to  be  valid  are  also  discussed.  Next, 
hypothesis  testing,  the  matched  filter  receiver,  and  the  derivation  of 
its  expected  output  are  presented  in  section  3.2.2,  followed  by  the 
derivation  of  the  expected  energy  of  the  channel  output  is  section  3.2.3. 
In  section  3.2.4  the  finite  volume  property  of  the  cross-ambiguity 
function  is  derived,  and  it  is  also  shown  that  the  function  is  bounded. 
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R  =  E  { /  X(f)  H(f,t)  ej27Tft  df  /  X*(f')  H*(f',t')  e"j2l,f'c'  df  ’  > 

^  —00  —00 
oo 

=  E  {//  X(f)  X*(f')  H(f,t)  H*  (  f  ' ,  t 1 )  ej2ir(ft-f  '*■">  dfdf  '  } 

—00 

00 

=  //  X(f)  X*(f')  E{H(f,t)  H*(f  ' ,  t  *  )  }  e  j  2ir(f  c-f  ’  t  ’ )  d£df, 

—  00 
00 

=*  //  X(f)  X*(f')  RR(f  ,f  ’  ,t,C' )  eJ2ir(ft~ft'>  dfdf'.  (2.4-28) 
—  00 

Under  the  WSSUS  assumption  Rjj(f  ,f ' ,  t,  t ' )  =  Rn(Af,At)  where  Af  »  f  -  f ' 
and  At  =  t  -  t',  so  (2.4-28)  becomes 

00 

R  (t,t’)  =  //  X(f)  X*(f’)  RH(Af,At)  e32lT(ft-f  ,c')  dfdf’. 

y  — *  'l  ■ 

(2. 4. -29) 

From  (2.4-26),  Rn(Af,At)  can  be  replaced  by  its  Fourier  transform 
relationship  to  the  scattering  function  Rg (  t ,  <|> ) ,  so  (2.4-29)  can  be 
restated  as 

Ry  ( t ,  t '  )  = 

////  X(£)  Rs(t,*)  X*(f  )  e  j2.(*(At)-(M)t) 


(2.4-30) 


This  multiple  integral  can  be  rewritten  as  the  multiple  iterated 
integ*  al 


R  (t,f)  =  JJ  /  X(f)  ej2lTf(t‘T)  df  R$(  t,  (j>) 

J  —  oo  .oo 


X  /  X(f')  e  j  27Tf  ’  C  c  ’ -T)  dfl  e  j  2  tt  4>(  At)  dT<J^  (2.4-31) 


By  (2.2-11)  the  bracketed  terms  in  the  integrand  of  (2.3-31)  become 


/  X(f)  e  j2lTf(t"T)  df  =  x(t-t)  . 


(2.4-32) 
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uncorrelated  with  the  value  for  all  f  greater  than  f0  +  B  or  less  than 
f0  -  B. 

The  important  result  of  this  section  is  that  under  the  assumption  of 
uncorrelated  spreading,  the  scattering  function  can  be  defined  as  well  as 
other  system  functions  based  on  the  original  system  autocorrelation 
functions.  These  other  system  functions  described  by  (2.4-20),  (2.4-23), 
and  (2.4-25)  are  related  to  the  scattering  function  as  either  forward  or 
inverse  Fourier  transforms.  This  is  illustrated  in  Figure  6.  From  the 
figure  it  can  be  seen  that 

R^Af.At)  -  //  Rg( x, <|>)  ej2*($Ut)-(Af)T)  dTd+  (2.4-26) 

—  QO 

which  can  be  verified  by  using  (2.4-20)  and  (2.4-25). 


2.4.3  The  System  Output  Correlation  Function 


In  this  section  the  correlation  function  of  the  linear  time-varying 
system  output  under  the  WSSUS  assumption  is  derived.  It  will  be  useful 
in  the  development  presented  in  the  next  chapter  where  random 
time-varying  system  theory  is  applied  to  signal  detection. 

By  definition,  the  output  correlation  function  is ^ 


Ry(t.t’)  =  E{y(t)  y*(t ' ) } 


(2.4-27) 


Replacing  y(t)  by  (2.3-19)  the  correlation  function  in  (2.4-27)  becomes 
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R^f.f ’,<!>,$’)  -  RgUf,*)  6(  $-<p '  ) 


(2.4-22) 


where 


Rg<Af  ,<fr) 


RS(T,^>) 


rj2TT(Af)T  dt> 


(2.4-23) 


Thus,  Rg(  Af ,  <j>)  is  the  Fourier  transform  of  the  scattering  function 
Rg(T,<}>)  with  respect  to  t. 

Finally,  an  expression  for  Ry(f ,f * ,t,t')  under  the  WSSUS  assumption 
can  be  found.  From  (2.4-11)  and  (2.4-19) 


RH(f,f',t,t’)  =  //  Rh(T,T’,t,f)  e-j2»(fT-f'T')  dTdT, 

— oo 

-  //  Rh(T,At)  <S(  t-t  ' )  e“i2,T(f  T_f  '  T’)  dxdT. 


/  ^(T.At) 


r  <5(  t-t'  )  e  -32*(ft-f,x’)  dT. 


dT. 

(2.4-24) 


Evaluating  the  inner  integral  gives 

RH(f,f',t,t')  =  /  Rh(x,At)  e-J2lT^>T  dT 

•  OO 

-  RH(Af,At).  (2.4-25) 

Equation  (2.4-25)  is  referred  to  as  the  time-frequency  correlation 
function.^  If  Rg(Af,At)  is  significant  only  on  a  region  centered  at 
( Af , At )  =  (0,0),  then  it  indicates  that  little  statistical  correlation 
exists  in  the  systems  behavior  for  either  time  separation  or  frequency 
separation.  For  example,  if  Rd(Af,At)  =  0  for  all  | At |  >  T,  then  the 
implication  is  that  the  system  transfer  function  at  time  t  is  in  no  way 
dependent  upon  the  form  of  the  system  transfer  function  for  all  time 
greater  than  t  +  T  or  less  than  t  -  T.  Similarly,  if  Rd(Af,At)  =  0  for 
all  j  Af  j  >  B,  then  the  value  of  the  system  transfer  function  at  any  f0  is 


^(T.T'.t.f)  =  //  Rg(r,T'f  ♦  ,*•)  eJ  2ir(  <*>t— '  t ' )  d^y 

—  00 


/  RQ  (  t  ,  <)> )  /  «(♦-*•)  ej27T(  4>t~4>’ t ' )  dy  d<)  5(t_tI)> 


(2.4-18) 


The  inner  integral,  due  to  the  properties  of  the  impulse  function,  is 
equal  to  eJ  2ir$(t-t ' ) .  if  the  notation  t  -  t'  *  At  is  adopted,  then 
(2.4-18)  can  be  rewritten  as 


Rh( T »  T  ’  »t,t')  =  Rh(T,At)  <S(  t-t*  ) 


(2.4-19) 


where 


Rh(x,At)  =  J  Rg ( x , )  ej2lT*Ut:)  d*. 


(2.4-20) 


Notice  that  R^( t , At )  is  inverse  Fourier  transform  of  the  scattering 
function  Rs(t,4>)  with  respect  to  $. 

A  similar  result  occurs  for  the  autocorrelation  function 
Rb  (f»f'  From  (2.4-13)  and  (2.4-16)  it  follows  that 

CO 

RB(f.f,,$,*')  =  a  Rs(T,T',*,*f>  e-j2i:(tT-f dTdT. 

—  00 
00 

-  JJ  R0(T,d)  6(  t-t'  )  5(  4>-q>' )  e-j  27TCt  T-f  •  T- )  dTdT, 


ao  ao 

J  R„  (  T  ,  )  J  fi(T-T')  e-j2TT(f  T-f  '  T'  )  dT,  dT 


(2.4-21) 


Again,  due  to  the  properties  of  the  impulse  function,  the  inner  integral 
is  equal  to  e“J  2:,( r  '  ) T.  Adopting  the  notation  Af  =  f  -  f 1  ,  (2.9-21)  can 


be  rewritten  as 
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2.4.2  Uncorrelated  Spreading 

In  their  present  form,  the  four  autocorrelation  functions,  (2.4-1) 
to  (2.4-4),  are  of  little  utility  for  communication  channel  description. 
The  additional  assumption  required  is  that  the  spreading  function  is 
essentially  uncorrelated  with  itself  for  different  values  of  T  and  <p. 

This  is  equivalent  to  expressing  the  spreading  function  autocorrelation 
function  as 

Rg  (t,t*  =  Rs  ( T ,  4> )  <$(  t-t*  )  S(4>-4>') 

-  ms(t,^)  ms*(T',4>')  (2.4-15) 

where  mg(T,<(>)  =  E{S(f,<j>)}.  Assuming  that  mg(T,<j>)  =  0  then  (2.4-15) 
reduces  to 

Rs(t,t'  ,<j>,<j>')  =  Rs(t,4>)  <S( t-t'  )  SU-<j>')  (2.4-16) 

where 

Rs(t,4>)  =  E{|S(T,<t>)|2}  >  0  .  (2.4-17) 

The  positive  semidefinite  function  Rg(i,()))  is  referred  to  as  the 
scattering  function10 » 1 1 > ^  and,  according  to  Ziomek.10,  can  be  interpreted 
as  a  density  function  which  determines  the  amount  of  delay  t  and 
frequency  shift  <p  an  input  signal  will  exhibit  at  the  output  of  a  random 
linear  time-varying  system  with  uncorrelated  spreading. 

Communication  channels  that  can  be  modeled  using  scattering  functions  are 
commonly  referred  to  as  wide  sense  stationary  uncorrelated  scattering 
(WSSUS)  channels.10*11 

The  WSSUS  assumption  not  only  effects  the  form  of  Rg(  T,  t'  ,  ij>,  y )  but 
also  the  remaining  three  system  autocorrelation  functions.  Consider 
R^(  T»  T  ' »  c  •  c  ’ )  and  its  relationship  to  Rg(  t,  t  ' ,  f>,  f>' )  via  the  Fourier 
transform.  Using  (2.4-10)  and  (2.4-16)  it  follows  that 


compressed  due  to  the  doppler  variable  s.  Therefore,  it  follows  that  if 
the  signal  bandwidth  is  defined  as 


BWTRANS  f2“  fl  (3.2-15) 

then  the  bandwidth  of  the  received  signal  is 

BWREC  =  s  f2  ”  s  fl  =  s  ^f2~  fl^  =  S  BWTRANS*  (3.2-16) 

This  change  in  bandwidth  can  be  related  to  the  received  signal  carrier 
frequency  defined  to  be  the  spectral  centroid.  Since  all  signals  are 
considered  to  be  elements  of  L,1(rA)  and  L.2(r1)  their  spectrums  are 
magnitude  integrable  by  Theorem  2.6,  so  the  spectral  centroid  is  defined 
as 


f 

°TRANS 


00 

/  f|x(f)|  df 

—GO 


00 

/  |x(f)|  df 

—00 


(3.2-17) 


If  the  spectrum  of  the  received  signal  is  given  by  (3.2-14),  then  the 
carrier  frequency  becomes 


/  £  |i  X(|)|  df 


REC 


/  if  *{)|  n 


/  f|x(f)|  df 

S 


/  |X(f)j  df 


s  f 


(3.2-18) 


TRANS 

Combining  (3.2-16)  and  (3.2-18),  the  received  signal  bandwidth  becomes 
f 


BW 


REC 


REC  f 


BW 


TRANS 


(3.2-19) 


The  shifting  of  spec 
Doppler  variable  s. 
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thus  the  change  in  bandwidth  is 


ABW  "  BWR£C“  BWTRANS 


=  (s-1)  BW, 


TRANS 


f  -  f 


TRANS 


TRANS 


TRANS 


TRANS 


(3.2-20) 


TRANS 


where  <j>  =  f  -  f  is  the  doppler  frequency  shift. 

°REC  otraNS 

from  (3.2-20)  this  frequency  shift  is  also  given  by 


Furthermore, 


-  (s-1)  f 


(3.2-21) 


TRANS 


It  is  easily  seen  that  if  the  carrier  frequency  is  substantially 
larger  than  the  transmit  signal  bandwidth,  the  change  in  bandwidth  due  to 
the  doppler  shift  4>  is  small.  If  a  transmit  signal  in  a  communication 
system  is  a  bandpass  signal  and  possesses  a  bandwidth  much  smaller  than 
the  carrier  frequency,  then  for  an  adequately  small  range  of  doppler 
frequency  shift,  the  return  signal  spectrum  given  by  (3.2-14)  can  be 
approximated  by  (3.2-10),  i.e., 


jjj-  X(|)  ej2*fT  *  X(f-<t>)  ej27tfT. 


(3.2-22) 


The  term  'adequately  small  doppler  frequency  shift'  means  that  the 
doppler  variable  s  is  never  large  enough  to  cause  the  change  in  bandwidth 
to  be  a  significant  percentage  of  the  transmit  signal  bandwidth. 


< 
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3.2.2  The  Matched  Filter  Receiver 

From  this  point  forward  ali  system  input  signals  are  assumed  to  be 
bandpass  signals  with  bandwidths  sufficiently  small  to  allow  the 
communication  channel  to  be  accurately  modeled  as  a  random  linear 
time-varying  system.  The  communication  channel  outputs  will  be 
considered  to  be  the  sum  of  the  responses  to  two  random  linear 
time-varying  systems  and  a  noise  process.  The  first  system  models  the 
channel  scattering;  the  second  system  models  the  medium  scattering. 
Detecting  the  presence  of  a  signal  is  reduced  to  the  hypothesis  test: 


Hq  :  r(t)  -  +  (3.2-23) 

:  r(t)  »  yCHN(t)  +  y^^  +  (2.3-24) 


where  r(t)  is  the  channel  output,  yR£v(t)  is  t*ie  reverberation  or  clutter 
response,  ycHN^)  *-s  c^e  response  to  channel  scattering,  and  n(t)  is 
noise.  It  will  be  assumed  that  the  noise  term  is  white  and  uncorrelated 
with  both  ycHN(c)  and  yREV(c)*  Furthermore,  it  will  be  assumed  that 
yCHN(t)  and  yR£v(t)  are  uncorrelated.  The  individual  responses  due  to 
the  channel  scattering  and  reverberation  are  given  by 

CO 

yCHN(t)  =  hCHN^T,t^  X^t-T^  dt  »  (3.2-25) 

—CO 

00 

yREV^c)  =  /  hREV^T,C^  x(c_t)  dt  *  (3.2-26) 


The  hypothesis  test  will  be  implemented  by  testing  the  output  of  a 
matched  filter  given  by 


1*1 


2 


00 

/  r(t )  g*(t)  dt 
—  00 


2 


(3.2-27) 


where  g(t)  is  called  the  processing  waveform.  If  the  output  |  2.|  ^  exceeds 
a  threshold  y  then  hypothesis  is  assumed  and  a  signal  has  been 
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detected.  The  threshold  Y  is  determined  to  maximize  the  probability  of 
detection  for  given  probability  of  false  alarm  (Neyman-Pearson  test). 

The  performance  of  the  matched  filter  receiver  for  different  threshold 
settings  is  presented  in  Van-Trees  so  it  will  not  be  discussed  here. 
The  entire  channel  model  and  receiver  structure  is  shown  in  Figure  9. 

Since  the  communication  channel  is  stochastic,  the  expected  matched 
filter  output  is  used  to  describe  the  performance  of  the  receiver.  For 
the  response  to  channel  scattering,  the  expected  output  is 


Because  it  is  assumed  that  the  noise  is  white,  its  autocorrelation 
function  is 

Rn(t.t')  -  N0  <5(t-t '  ) ,  (3.2-30) 


so  the  expected  value  of  the  receiver  output  due  to  the  noise  term  is 

E{|*J2}  *  //  g*(t)  Rn(t,f)  g(t)  dtdt ' 

—  00 


■  N  //  g*(t)  6(t-t ' )  g(t')  dtdt' 

—  00 
00 

“  N  J  J  g( t ) ] 2  dt  . 

—  00 


(3.2-31) 


5 


a 


A  figure  of  merit  used  to  judge  the  performance  of  the  matched 
filter  receiver  is  the  signal-to-noise  ratio  defined  as 


SNR 


e'I*chnI  1 


(3.2-32) 


E'IWlJ  1 


where  | ^rev  +  is  the  receiver  output  response  to  both  reverberation 

and  noise.  Because  it  has  been  assumed  that  the  noise  is  uncorrelated 
with  the  reverberation  response,  the  denominator  of  (3.2-32)  can  be 
written  as 


Efl  W*nl2)  '  EII1sevI2>  +  2  '•(e<*rev  V}]  +  e{I‘„|2) 

-  Etl  *tuEV!2>  +  E{|lnI2}  •  (3.2-33) 

The  cross-correlation  term  vanishes  in  (3.2-33)  because 


E{*REV  £n*}  “  //  8*(c>  E{yREV(t)  n*(t)}  8(t,)  dtdt’  (3.2-34) 


and  E(yREV(t)  n*(t)}  is  assumed  to  be  zero.  Thus,  the  SNR  in  (3.2-32) 
can  be  rewritten  as 


SNR 


E{iw2) 


E(|tR£v|2>  *  E(|tn|2) 


II  g*(t)  R  (t,t’)  g(t’)  dtdt ’ 
-•  yCHN 


.  (3.2-35) 


IJ  g*(t)  R  ( t ,  t  ’ )  g(t')  dtdt’  +  N  /  | g ( t ) |  dt 


REV 


Although  the  SNR  has  been  somewhat  simplified  in  (3.2-35)  both  the 

numerator  and  denominator  can  be  reformulated  in  terms  of  the  scattering 

functions  for  both  the  channel  scattering  and  reverberation.  Consider 

the  numerator  in  (3.2-35),  using  (2.4-33)  and  expressing  the  channel 

scattering  function  as  R-  (t,$),  the  expected  matched  filter  output  is 

^CHN 
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If  the  SNR  is  used  as  a  performance  measure  for  the  matched  filter 
receiver,  then  optimization  (increasing  the  SNR)  can  be  accomplished  by 
either  attempting  to  decrease  the  reverberation  response  without 
significantly  decreasing  the  channel  response  or  increasing  the  channel 
response  without  significantly  increasing  the  reverberation  response. 
Stutt^,  and  Spafford*6, 17 f  have  been  successful  in  optimizing  the  SNR 
for  a  point  scatterer  embedded  in  doubly  spread  reverberation  by  altering 
the  processing  waveform  so  as  to  minimize  the  denominator  of  (3.2-39). 

In  this  thesis,  the  numerator  will  be  increased  by  altering  the 
processing  signal.  This  will  imply  an  increase  in  SNR  if  the  denominator 
of  (3.2-39)  does  not  also  significantly  increase.  If  the  only 
interfering  signal  is  the  white  noise  process,  then  the  first  term  in 
the  denominator  is  zero  (no  reverberation  response).  In  this  case, 
increasing  the  numerator  guarantees  an  increase  in  SNR. 
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defined  as 


Er  =  Elf  | y ( t ) | 2  dt}  =  /  E{|y(t)|2}  dt. 


(3.2-42) 


Substituting  (3.2-41)  into  (3.2-42)  gives 


Er  3  Iff  J x( t-r) j  2  Rs(t,4>)  dTd<f>dt 


where 


If 


J  |x(t-T)|  dt 


Rg(T,()»)  d xd <f> 


=  Et  II  Rs (  T ,  )  dTd<t> 


(3.2-43) 


o  00 

E t  3  /  | x(  t— t ) |  dt  =  /  I x( t ) I  dt 


(3.2-44) 


is  the  system  input  energy  or  transmit  energy. 

At  this  point  several  statements  can  be  made.  First,  in  any 
practical  communication  channel  it  can  be  assumed  that  the  return  energy 
will  be  finite.  Second,  the  return  energy  can  be  expected  to  be  less 
than  that  of  the  transmitted  energy.  This  occurs  because  a  scatterer  can 
subtend  only  a  small  portion  of  the  transmitted  wavefront,  thus  it 
reflects  only  a  fraction  of  Et.  Furthermore,  some  of  the  transmitted 
energy  is  lost  due  to  absorption  loss  of  the  medium  and  spherical 
spreading  of  the  transmitted  and  reflected  wavefronts.  Therefore  it  can 
be  concluded  that 

-  E 

II  Rs(T,ct>)  dxd4>  =  -g1  <  1  .  (3.2-45) 

-CO  C 

Furthermore,  because  the  scattering  function  is  positive  definite 

00  00 

//  Rs(t,<|>)  did*  =  //  j Rs(  T,  d>) I  drd<|> 


(3.2-46) 
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and  so  ic  is  an  element  of  L*(R2). 

Scattering  functions  that  are  significant  over  a  region  of  the  (t,?) 
plane  are  said  to  describe  doubly-spread  or  doubly-dispersive 
communication  channels.  Such  scattering  functions  will  be  considered  in 
this  thesis,  and  in  Chapter  4  a  method  for  optimizing  a  receiver  to 
detect  a  signal  that  has  been  transmitted  through  a  doubly-spread 
communication  channel  is  presented. 


3.2.4  Properties  of  the  Cross-ambiguity  Function 
From  the  SNR  equation  (3.2-39)  it  is  easily  seen  that  the 
cross-ambiguity  function  in  part  determines  the  performance  of  the 
receiver.  Since  the  channel  and  reverberation  scattering  functions  are 
not  free  parameters,  optimization  of  the  receiver  response  to  a  signal 
can  only  be  done  by  altering  either  the  transmit  signal  x(t),  the 
processing  signal  g(t),  or  both.  Since  these  signals  are  related  to 
receiver  response  through  the  cross-ambiguity  function,  it  will  play  a 
major  role  in  the  optimization  procedure  developed  in  Chapter  4.  Since 
the  cross-ambiguity  function  is  significant  to  the  theory  developed  in 
this  thesis,  it  is  appropriate  to  discuss  some  of  its  properties. 

The  most  important  property  of  the  cross-ambiguity  function  is  that 
its  volume  is  finite.  To  prove  this  it  is  necessary  to  find  a  new  form 
of  the  uncertainty  function.  From  (3.2-37),  and  by  expressing  the 
waveforms  x(t)  and  g(t)  as  inverse  Fourier  transforms  gives 


X  (t,4>)  =  /  x(t-t/2)  g*(t+T/2)  el2**1  dt 

X  >  O  _ 


/  X(f)  eJ27Tf(c~T/2)  df 


//  x(f)  G*(n) 


/  e 

—  CO 


j2ir(f-n+<l>)t 


/  C(n)  e22,"'(t+T/2)  dn 
-j  2ir(f+n)  t/2 


dt 


ej2nt  dt 


dfdn.  (3.2-47) 


The  bracketed  term  in  the  integrand  of  (3.2-47)  can  be  replaced  by  a 
delta  function,  i.e., 


/"  eJ2.(£-n+*)t  dc  .  4(n^.„ 


(3.2-48) 


Replacing  (3.2-48)  in  (3.2-47)  gives 

00 

x  „(:,♦)  «  If  X(f)  G*(n)  <S( n— f — 4>)  t/2  dfdn 

8  _ao 

-  //  X(f)  G*(f+<|>)  e”j 2ir(f +$/2) t  df#  (3.2-49) 

—  CO 


Equation  (3.2-49)  now  expresses  the  uncertainty  function  in  a  useable 
form. 

The  volume  of  the  cross-ambiguity  function  can  now  be  found.  From 
the  definition  of  the  cross-ambiguity  function  in  (3.2-49),  the  volume  is 
given  by 


v  =  //  lx*  C  t  ,  <}) )  1  2  dxd<4>  =  //  X  a(T,$)  X*  (T,*)  did^ 

r* 

»  //  /  X(f)  G*(f+$)  e*j27r(f+$/2>T  df 

—  00 

/  x(n)  G*(n+4>)  e'*j2ir<ri+(t,/2>T  dn 


=  JfJ  X(f)  x*(n)  G*(f+$)  G(n+$) 


/  e 
—  00 


did<{> 

-j  2ir(f-n)  t 


dr 


dndfd<j>* 

(3.2-50) 


As  in  (3.2-47),  the  bracketed  term  in  the  (3.2-50)  can  be  replaced  by  an 
impulse  function  thus  reducing  (3.2-50)  to 
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CO 

V  =  ///  X(f)  x*(n)  G*(f+<|>)  G(n+$)  S(f-n)  dndfd* 

—co 


=  //  X(f)  X*(f)  G*(f+<$>)  G(£+<p)  dfd<p 

—CO 

00 

=  //  I  X(f  )  j  2  I  G(f+d>)  I  2  dfd*. 


(3.2-51) 


-  E  E  <  «  .  (3.2-54) 

g  x 

Thus,  Che  volume  of  the  cross-ambiguity  function  is  finite  and  is  equal 
co  the  product  of  the  energies  of  the  transmit  and  processing  waveforms. 

By  convention,  the  value  of  the  processing  signal  is  dimensionless 
as  opposed  to  the  transmit  signal  which  may  have  units  of  either  volts  or 
amperes.  Therefore,  Eg  is  a  dimensionless  number  and  the  volume  of  the 
cross-ambiguity  function  is  in  units  of  energy. 
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Another  property  of  the  cross-ambiguity  function  is  that  it  is 
bounded.  This  can  be  shown  by  applying  the  Schwartz  inequality,  i.e., 


00  ‘ 
)C.  cr(T.4,)|2  =  /  x( t-i/2)  g*(t+T/2)  e^27T<*>t  dt 

X,g  -CO 


<  /  |x(t-r/2)|2  dt  /  | g(t+x/2) | 2  dt  .  (3.2-55) 


By  definition,  the  integrals  on  the  right  side  of  the  inequality  in 
(3.2-55)  are  the  energies  of  the  transmit  and  processing  signals,  so 

lVg(T,*)|2  <EXV“.  (3.2-56) 

Thus  the  cross-ambiguity  function  is  bounded. 


3.3  Principles  of  Matched  Filter  Receiver  Optimization 

From  this  point  forward  it  will  be  assumed  that  the  transmit  signal 


x(t)  and  processing  signal  g(t)  are  unit  energy  waveforms,  i.e., 


/  |x(t)j  2  dt  =  1, 
—  00 

/  lg(t) 1 2  dt  =  1. 


(3.3-1) 

(3.3-2) 


These  assumptions  will  cause  no  loss  of  generality  in  any  of  the  theory 
developed  in  this  thesis.  A  consequence  of  (3.3-1)  and  (3.3-2)  is 
00 

//  |x  (^$)|2  dTd*  =  E  E  =  1  (3.3-3) 

-CO  °  x  S 

and 

|Xx,g(T,»)52  <EX  Eg=  1.  (3.3-4) 

Thus,  the  cross-ambiguity  function  has  unity  volume  and  is  bounded  by 


one . 


1  V  'V 
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J(g+n)  =  J(g)  +  6J(g,n)  +  j  62J(g,n)  +  ... 

+  4  «(m4g,n)  +  ... 

D1  i 

where 


(4.3-21) 


5^m^J(g,n)  =  lira  J(g+en) 
e+0  dem 

and  is  referred  to  as  the  m-th  Gateaux  derivative  of  J(g)  at 
increment  n(t).  In  section  4.3.1,  it  was  shown  that  J(g+en) 
degree  polynomial  in  £  (see  equation  (4.3-5));  therefore,  by 
of  (4.3-22)  all  Gateaux  derivatives  of  order  three  or  higher 
to  zero.  Thus, 


(4.3-22) 

g(t)  with 
is  a  second 
examination 
are  equal 


J(g+n)  =  J(g)  +  6J(g,n)  +  j  62J(g,n)  +  ... 

oo 

=  J(g)  +  2  Re {/  n(t)  $*(t)  dt)  +  J(n),  (4.3-23) 

—  00 


by  (4.3-8),  (4.3-19),  and  (4.3-21).  It  is  fortunate  that  the  Taylor 
series  can  be  truncated  to  only  three  terras  since  this  will  allow  for  an 
accurate  analysis  of  the  optimization  procedure  developed  later. 

4.3.4  Properties  of  the  Gradient  Function 
In  section  4.3.1,  the  gradient  function,  given  by 
00 

*(t)  =  If  X*  (  T,#)  R-(T,*)  X(t-T)  ej2^(c-T/2)  dTd$,  (4.3-9) 
x  >  a  3 

arose  from  rearranging  the  integral  that  defined  the  first  Gateaux 
differential  of  the  cost  functional  J(g).  As  with  the  cross-ambiguity 
function,  since  the  gradient  function  is  significant  to  the  theory 
developed  in  this  thesis,  it  is  appropriate  to  discuss  some  of  its 


properties. 
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Therefore , 


lim 
II  0  H  2  -K3 


|52J(g,n,8)-d2J(g,n,8)| 

il  S  H  2 


lim 
H  0 II 2  -*0 


0 

II S II 2 


=  0. 


(4.3-17) 


Thus,  (4.3-15)  holds  for  5J(g,n)  as  given  by  (4.3-6)  so  J(g)  is  twice 
Frechet  differentiable  and  6J(g,n,S)  describes  a  unique  and  continuous 
linear  mapping  of  all  8(t)  in  L^(R^)  to  R^. 

For  the  purposes  of  analyzing  the  behavior  of  J(g)  as  g(t)  changes 
to  g(t)  +  8(t),  the  second  Frechet  derivative  must  be  evaluated  with 
n(t )  »  8(t).  In  this  case,  the  second  Frechet  derivative  is  written  as 

62J(g,n,n)  =  62J(g,n).  (4.3-18) 


From  (4.3-18)  it  follows  that  for  the  cost  functional  given  by  (4.1-1), 
the  second  Frechet  differential  is 

OO 

52J(g,n)  =  2  //  |x  2  Rg(  t,  <}>)  drd<|>  =  2  J(n).  (4.3-19) 

Note  that  because  the  second  Frechet  differential  is  proportional  to  the 
cost  functional  evaluated  at  n(t),  the  cost  functional  itself  is  a 
continuous  mapping. 


4.3.3  The  Generalized  Taylor  Expansion 

It  is  well  known  from  elementary  calculus  that  the  function  of  a 
real  variable,  f(x),  can  be  written  as  a  Taylor  series,  i.e., 

f(x+Ax)  =  f(x)  +  f'(x)  Ax  +  y  f '  ’(x)  (Ax)2  +  ... 

+  -j-  f(m)(x)  (Ax)m  + -  (4.3-20) 

m : 

It  can  be  shown  that  the  Taylor  series  can  be  generalized  for  functionals 
that  map  one  linear  space  to  another ^ , 20.  jn  ^is  case  che  Taylor 
expansion  for  the  cost  functional  is  expressed  by 


5J(g,n,3)  =  lim 


e+0 


=  lim 

e+0 


IT  2  Re{ll  Xx,g+e6(T’<J,)  *x,  n(x>  <*>)  RS(t’*}  dTd*} 

oo 

2  Re  {//  ^  CT( T,  4>)  x*  n(x,<j>)  Rg( t,  <t>)  dTd4>} 

00 

+  2  £  Re  { //  ^  g(x,<t>)  X*  n(x,<|>)  Rs  (  T ,  4> )  dxd1>} 

00 

-  2  Ret//  Xx  g(x,<f>)  x£  n(T,  ♦)  Rs(t,4>)  did*}.  (4.3-14) 

By  its  definition,  the  second  Gateaux  derivative  is  just  the  first 
Gateaux  derivative  of  5J(g , n)  with  respect  to  g(t).  In  effect,  6J(g,n) 
has  been  treated  no  differently  than  any  other  functional  whose  first 
Gateaux  derivative  has  been  sought.  Thus,  <s2j(g,n,3)  gives  the  first 
order  change  in  5J(g,n)  when  g(t)  changes  to  g(t)+g(t). 

As  with  the  first  Gateaux  derivative,  <$^J(g,n,g)  describes  a 
continuous  and  unique  mapping  from  L^R^)  0  L,2(r2)  if  it  is  also  the 
Frechet  derivative  of  <5J(g,n)  with  respect  to  g(t).  Equation  (4.3-13) 
describes  a  second  Frechet  derivative  if  5J(g,n)  is  a  Frechet  derivative 
of  J(g)  and  if  for  a  fixed  g(t)  in  L^(R^)  H  1.2(r1),  and  3(t)  an  arbitrary 
element  in  L.2(r1),  then 


1 1  m  I 6J(g+8, n)-6J(g, n)-62j(g, n, g) i  =  n 
"eil2 


(4.3-15) 


Without  loss  of  generality,  the  increment  3(t)  can  be  replaced  by  ez(t) 
where  e  is  a  real  constant  and  II z II 2  3  !•  The  differential  of  the  two 
first  Frechet  derivatives  of  J(g)  in  the  numerator  of  (4.3-15)  can  be 
found  from  (4.3-14)  and  (4.3-6)  to  be 

S-Kg+S.n)  -  5J(g,n) 

00 

3  2  Re {//  Xx  g( T ,  4>)  X*x  n(T,4>)  R$(  x,  q> )  dxd*t>} 

—  00  *  * 

3  <52J(g,n,6). 


(4.3-16) 
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lim 
e>0 

=  lim  |  e  |  JJ  |x  z(T,<j>)|2  R  dTd$  =  0.  (4.3-12) 

This  shows  chat  <SJ(g,n)  is  a  FrecheC  derivative  and  that  it  is  a 
unique  and  continuous  mapping  from  the  set  of  all  functions  in 
LkR1)  D  L^R1)  to  R1. 

4.3.2  The  Second  Gateaux  Derivative 

Although  the  second  Gateaux  derivative  will  not  be  used  in  the 
development  of  the  optimization  procedure  presented  later,  it  will  be 
useful  in  the  analysis  of  its  performance  and  convergence.  Furthermore, 
the  definition  ultimately  used  does  not  express  the  second  Gateaux 
derivative  in  its  most  general  form;  however,  it  will  be  suitable  for 
analyzing  the  behavior  of  the  cost  functional. 

The  second  Gateaux  derivative  of  the  cost  functional  J(g)  with 
increments  n(t)  and  8(t),  where  g(t)  is  in  L^(R^)  and  l2(r1),  and  n(t) 
and  B(t)  are  arbitrary  elements  of  L^(r1)  is  given  by ^,20 

62J(g,  n,  8)  =  lim  5J(g+eS,n).  (4.3-13) 

z+o 

In  section  4.3.1  the  first  Gateaux  derivative  was  expressed  by  (4.3-6), 
thus,  by  applying  the  definition  in  (4.3-13),  and  using  the  property  of 
the  cross-ambiguity  function  given  by  (4.3-3),  the  second  Gateaux 
derivative  is 


J(x+ez)-J(x)-<SJ(g,  ez) 
Uezl!„ 


» 

1 


5 
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6J(g,n)  =  2  Re  {/  n(t)**(t)  dt}  =  2  Re{<n,$>}  (4.3-8) 

—  oo 

where  $(t)  is  called  Che  'gradient  function'  defined  as 

$(t)  =  //  K*  (*,$)  RQ(T,<t>)  x(t-T)  e^21^1"  T^2')  did <{>.  (4.3-9) 

The  properties  of  the  gradient  function  will  be  discussed  in  section 

4.3.4. 

The  Gateaux  derivative  of  J(g)  as  given  in  (4.3-6)  and  (4.3-8) 
defines  a  linear  transformation  (mapping)  from  the  set  of  all  functions 
in  L,1(r1  )  O  L2(RM  to  R1.  However,  from  its  definition,  the  existence  of 
a  Gateaux  derivative  does  not  imply  that  it  is  a  continuous  or  a  unique 
mapping  of  one  linear  space  to  another.  This  occurs  because  the  norm  of 
the  space  L2(R^)  is  not  involved  in  the  definition  given  by  (4.3-1). 
Continuity  and  uniqueness  can  only  be  guaranteed  by  a  Frechet  derivative. 

By  definition,  the  Gateaux  derivative  <5J(g,n)  is  also  a  Frechet 
derivative2*  19> 20  if  for  a  fixed  g(t)  in  L^R1)  and  L2(R1)  and  an 
arbitary  n(t)  in  L^R1),  then 

lim  1 J ( x+n ) - J(_x ) - <S J( g ,_n)J_  =  0  #  (4.3-10) 

ii  n  n  2  llnil2 

Without  loss  of  generality,  the  increment  n(t)  can  be  replaced  by  ez(t) 

where  £  is  a  real  constant  and  II z II 2  =  The  numerator  of  (4.3-10)  can 

be  found  from  (4.1-1),  (4,3-5),  and  (4.3-6),  i.e. 

J(g+ez)  -  J(g)  -  5J(g,en) 

00 

=  e2  //  lx*  Z(T,«)  |  RS(T,*)  did*. 

—  OO 

Applying  (4.3-11)  to  (4.3-10)  gives 


(4.3-11) 


J(g+en) 


(4.3-5) 


which  is  a  second  degree  polynomial  in  e.  Evaluating  the  derivative  of 
(4.3-5)  with  respect  to  e  and  taking  the  limit  as  e  0  gives 
the  first  Gateaux  differential  of  the  cost  functional  J(g)  as 


00 

6J(g,n)  -  2  //  8e{xXfg(T,*)  x£>n(T.*)>  Rs(T.<tO  dtd* 

00 

=  2  Re  {//  <t(t»1))  x£>n('T>!f>)  Rs(t,<)>)  d td (j) } .  (4.3-6) 


Equation  (4.3-6)  can  be  rewritten  into  a  form  that  will  be 
convenient  later.  This  is  done  by  replacing  the  conjugated  uncertainty 
function  in  the  integrand  by  its  definition  in  (3.2-37),  changing 
variable  of  integration,  and  rearranging  terms  as  follows: 


5J(g,n) 

00 

*  2  Re {//  Xx  (t,$) 
A  >  s 


/  x(t-x/2)  n*(t+x/2)  e-'  11  ^  dt 


Rg(T,4>)  d Td  } 


=  2  Ret//  (t,  ♦) 

_co  X>  S 


2  Ret/  n(t) 


/  x* ( t — x )  n(t)e'j27T:!>(,:“T/2)  dt 
—  00 

//  x*  (*»♦)  RS(T,*)  x* ( t — t )  e-^‘-T/2)  d 


Rg  (  x ,  ij) )  dxd  4>} 


xd<J> 


dt. 


(4.3-7) 


Equation  (4.3-7)  can  be  more  compactly  written  as 
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second  Gateaux  derivatives  of  the  cost  functional.  They  are  often  given 
Che  interpretation  of  being  generalized  directional  derivatives  over  a 
linear  space,  and  in  the  engineering  literature  these  derivatives  are 
referred  to  as  the  first  and  second  variation. 

Let  g(t)  be  an  element  of  l1(r1)  and  l2(R_1)  and  n(t)  be  an  arbitrary 
element  of  l2(R.1),  then 


6j(g,n)  =  lim  J(g+en) 
e-*o 


(4.3-1) 


is  the  first  Gateaux  differential  of  J(g)  with  increment  n(t)2» 19,20.  To 
start,  it  is  necessary  to  find  the  derivative  on  the  right  side  of 
(4.3-1),  therefore, 

J(g+eTl)  JJ  lxXfg+en(x,*)|2  Rs(t,4>)  dTd*  *  <4*3-2) 

The  integral  in  (4.3-2)  can  be  expanded  as  a  polynomial  in  e.  From  the 
definition  of  the  uncertainty  function  by  (3.2-37), 


x  T»^  =  /  x(t— T/2)  tg(t+x/2)  +  en(t+t/2]*  e 

x  >  *•  -00 

00  •  o 

*  /  x(t-T/2)  g*(t+T/2)  ej2ir4,t  dt 


j2ir<J>t 


+  e  /  x(t-t/2)  n*(t+f/2)  e 

—  GO 

■  +  c  *x,n(T'4)' 


j  2n  4>t 


(4.3-3) 


It  follows  that  the  cross-ambiguity  function  derived  from  (4.3-3)  is 


x,g+en 


(t,$)|2  58  \\  g(f,$)|2  +  2  Re  txx>g( T,  *)  x£jn(T,$)}  e 


(4.3-4) 


Substituting  (4.3-4)  into  the  integral  of  (4.3-2)  gives 


By  Theorem  4.1,  the  supremum  on  the  right  side  is  uniquely  known,  giving 


inf  llx-yll^  =  II x II ?  +  1-2  U x It _ -  (4.2.8) 

y(t)€s  2  2  2 

where  y(t)  =  x(t)/llxll2«  The  right  side  of  (4.2-8)  is  a  quadratic 
equation  of  the  norm,  and  it  can  be  easily  verified  that  its  value  is 
always  greater  than  or  equal  to  zero.  Thus,  the  unique  best  approxima¬ 
tion  to  any  x(t)  with  a  non-zero  norm  by  an  element  of  the  set  S  defined 
by  (4.2-5)  is  y0(t)  *  x(t )/ llxll 2*  This  completes  the  proof  of  the 
theorem. 

Although  Theorem  4.2  is  stated  in  terms  of  L^(r1),  it  can  extend  to 
other  Hilbert  spaces.  This  can  be  illustrated  by  considering  a  Hilbert 
space  familiar  to  nearly  everyone;  the  set  of  all  two-dimensional  vectors 
R.2  (the  real  plane).  Figure  10  shows  a  portion  of  R^  near  the  origin  and 
the  set  S,  which  in  this  case  is  the  unit  circle.  Also  shown  is  the 
vector  x  and  its  best  approximation  in  the  set  S,  the  vector  yQ.  It  is 
intuitively  clear  that  the  best  approximation  is  mearly  a  scaled  version 
of  x  that  has  unity  length.  Other  vectors  in  S  are  shown,  and  it  can  be 
seen  that  the  norm  of  the  difference  vectors  between  them  and  the  vector 
x  are  not  minimal. 

4. 3  Gateaux  Derivatives  of  the  Expected  Matched  Filter  Output 

4.3.1  The  First  Gateaux  Derivative 

As  mentioned  in  the  introduction  to  this  chapter,  the  cost 
functional  used  in  the  optimization  of  the  matched  filter  receiver  is  its 
expected  output  subject  to  the  constraint  that  the  value  of  the  norm  of 
both  the  processing  signal  and  transmit  signal  is  unity.  To  develop  and 
analyze  the  optimization  procedure,  it  is  necessary  to  find  the  first  and 


62 


Proof  By  the  properties  of  the  complex  numbers  and  the  Schwartz 
inequality 

Re<x,y>  <  | <x,y>|  <  lx»2  a y H 2 .  (4.2-3) 

The  right  inequality  of  (4.2-3)  is  an  equality  if  and  only  if 
y(t)  =  X  x(t)  where  X  £  R^.  Because  y(t)  is  an  element  of  S,  either 
X  *  1  /  H x II 2  or  X  =  —  1  /  II x II 2 »  causing  <x,y>  to  be  a  real  quantity,  thus 

Re{<x,y>}  -  ±  <x,y>  =  ±  11  x U 2 *  (4.2-4) 

Therefore,  the  left  inequality  of  (4.2-3)  is  an  equality  and  Re{<x,y>)  is 
maximized  only  when  X  =  1/ llxll 2*  Equation  (4.2-2)  follows  immediately. 

Theorem  4.1  is  used  to  prove  the  following  theorem. 

Theorem  4.2  Let  S  be  the  set  of  all  functions  in  L^(R^)  whose  norm  is 
unity.  For  any  non-zero  element  x(t)  in  l2(rI),  the  unique  best 
approximation  by  an  element  y0(t)  in  S  is  given  by  y0(t)  -  x(t)  /  Hxll2- 

Proof  Finding  the  best  approximation  is  defined  as  a  minimum  norm 
problem;  in  other  words,  an  element  yQ(t)  is  sought  such  that 

llx-y  Up  =  inf  llx-yllp  >  0.  (4.2-5) 

°  y(t)  £  s 

In  the  Hilbert  space  L^(R^)  the  norm  can  be  defined  in  terras  of  the  inner 
product,  thus  (4.2-5)  can  be  restated  as 

inf  llx-yllp  =  inf  <x-y,x-y> 

y(t)  £  S  z  y(t)  £  S 

=  inf  {»xllp  +  ll y ll ?  -  2  Re{<x,y>}}.  (4.2-6) 

y(t)£  s 

2 

Since  y  £  S  then  U y II ^  =*  1,  therefore, 

inf  Hx-yll?  ■  lx  I?  +  1  -  2  sup  Re{<x,y>}. 

y(t)  £  S  y(t)  €  s 


(4.2-7) 
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In  section  4.2  the  set  of  unit  energy  signals  is  discussed,  and  a 
theorem  is  stated  and  proved  giving  the  best  approximation  of  any  element 
in  the  space  l2(r!)  by  a  unit  energy  signal.  In  section  4.3,  the  first 
and  second  Gateaux  derivatives  (also  known  as  first  and  second  variations) 
of  J(g)  are  derived  as  well  as  its  generalized  Taylor  expansion,  and  in 
section  4.4  the  procedure  for  optimizing  J(g)  subject  to  the  constraints 
(4.1-2)  and  (4.1-3)  is  presented.  Finally,  in  section  4.5  several 
numerical  examples  of  optimizing  a  matched  filter  receiver  using  this 
procedure  are  given. 

4.2  The  Set  of  Unit  Energy  Signals 

As  indicated  in  Chapter  1,  all  functions  used  to  represent  signals 
will  be  elements  of  L^(r1)  and  L^(R^).  Furthermore,  it  was  established 
in  section  3.3  that  the  energies  of  both  the  transmit  and  processing 
signals  are  unity.  Thus,  it  follows  that  the  set  of  all  functions  in 
L.2(rI)  whose  norm  is  equal  to  unity  is  of  considerable  importance  to  the 
development  of  the  theory  presented  in  this  chapter.  Two  theorems  are 
stated  and  proved  below.  The  second  theorem  will  be  applied  in  the 
development  of  the  optimization  procedure  developed  in  Chapter  4.4. 

Theorem  4. I  Let  x(t)  and  y(t)  be  non-zero  elements  of  the  complex 
Hilbert  space  l2(r1).  Let  y(t)  be  an  element  of  the  set  of  all  functions 
in  L2(Rl)  with  unit  norm  (unit  energy),  i.e., 

S  =  (y(t)€  L2(RL)  |  II y 1 2  =  1}.  (4.2-1) 

If  x(t)  is  fixed  then  the  value  of  Re{<x,y>}  is  maximized  if  and  only  if 
y(t)  *  x( t ) / 13 xll 2  giving 

sup  Re(<x,y>}  =  II  x  H  ~ . 

y(t)  e  s 


(4.2-2) 


CHAPTER  4 


OPTIMAL  SIGNAL  DETECTION 

4. 1  Introduction 

In  this  chapter  an  iterative  optimization  procedure  is  developed  to 
increase  the  expected  matched  filter  receiver  response  to  a  signal  that 
has  been  transmitted  through  a  communication  channel  whose  average 
scattering  properties  are  known.  As  might  be  expected,  the  cost 
functional  used  in  this  procedure  is 

00 

J(g)  =  E{|£|2}  -  //  R„(T,4>)  |  Y  (T,$)|2  dTd<j>,  (4.1-1) 

-00  a 

where  an  increase  in  the  value  of  J(g)  is  sought  subject  to  the 
constraints 

00 

H x U 2  ■  /  I x( t ) I  2  dt  •  1,  (4.1-2) 

—00 

oo 

H g N 2  3  /  |g(t)|  dt  =  1.  (4.1-3) 

—  00 

In  other  words,  both  the  transmit  and  processing  signals  have  unit 
energy.  The  cost  functional  is  defined  for  all  x(t)  and  g(t)  in  L^(R^), 
and  this  can  be  shown  by  examining  the  integrand  of  (4.1-1).  Because 
x(t)  and  g(t)  have  finite  energy,  the  value  of  the  cross-ambiguity 
function  is  bounded  for  all  t  and  <$>  by  (3.2-56).  Furthermore,  it  was 
shown  in  section  3.2.3  that  the  scattering  function  has  finite  volume, 
thus 

00 

J(g)  £  Ixll 2  Sg«2  Jl  RS(T,$)  did*  <  »,  (4.1-4) 

—  00 

showing  J(g)  exists  for  all  x(t)  and  g(t)  in  L^(R^).  In  fact,  it  means 
that  J(g)  is  a  bounded  functional  over  the  space  L^(R^). 
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function  is  bounded  by  unity  the  constant  K  is  constrained  by 

K  <  W  r‘  (,,<,)  -  (3.3-9) 

RLn  r2  schn 

thereby  causing  the  right  side  of  (3.3-8)  to  be  less  than  or  equal  to 
one  for  all  ( t , <}> )  £  R^O  R£. 

Equations  (3.3-8)  and  (3.3-9)  imply  that  if  the  cross-ambiguity  and 
scattering  functions  are  constrained  to  be  non  zero  in  the  regions  R^  and 
R2,  respectively,  then  the  expected  value  of  the  matched  filter  receiver 
response  to  the  channel  output  is  maximized  if  the  cross-ambiguity 
function  has  the  same  shape  as  the  channel  scattering  function  in  their 
region  of  intersection  in  the  ( x ,  4> )  plane.  If  the  regions  R^  and  R2  have 
finite  areas  of  nonintersection,  then  some  of  the  volume  of  the 
cross-ambiguity  and  channel  scattering  functions  are  lost  when  their 
product,  the  integrand  of  (3.2-38)  is  formed.  This  implies  that  an 
improvement  in  the  matched  filter  response  can  occur  if  the  cross 
ambiguity  function  and  the  scattering  function  subtend  the  same  region  in 
the  ( x ,  (p )  plane.  Since  the  channel  scattering  function  is  not  a  free 
parameter,  improvement  in  receiver  performance  can  only  be  attained  by 
altering  the  shape  of  the  cross-ambiguity  function  which  can  be  done  by 
changing  either  the  transmit  signal,  the  processing  signal,  or  both.  A 
method  for  optimizing  the  matched  filter  output  by  altering  the 
processing  signal  is  presented  in  Chapter  4. 
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Consider  the  expected  value  of  the  matched  filter  receiver  output  to 
a  received  signal  which  was  shown  in  section  3.2.2  to  be  given  by 

00 

E{!*CHn!2}  =  //  I  Xx>g<T»'*>)l  2  RSCHN(T’^  dTd<!>-  (3.2-38) 

If  R-i  is  defined  to  be  the  region  of  the  ( t,  <J>)  plane  where  the  cross¬ 
ambiguity  function  is  non-zero,  and  R2  is  defined  to  be  the  region  where 
the  scattering  function  is  non-zero,  then  (3.2-38)  can  be  restated  as 

e{UCHN|2>  -  //  l*x  Rs  (*>$)  dTd*‘  (3‘3-5) 

By  the  Schwartz  inequality  an  upper  bound  for  (3,3-5)  can  be  found, 


e{|‘chk|2>  ■R//R2|xx,g<T^)|2WT’*)  dTd* 

00  00 

<  //  |x  „(*,*)( 2  dtdtj>  //  R  ( t,  <J>)  dtd* 

—  GO  X»S  _oo  SCUN 

Er 

-  f -  Er  <  1.  (3.3-6) 

x 


From  the  Schwartz  inequality  the  matched  filter  receiver  output  can 
attain  its  bound  if  and  only  if 

|xx>g(T,<|))|2  =  K  rSchn(t,*)  for  (t,*)  R1OR2,  (3.3-8) 

where  K  is  a  positive  real  constant.  Because  the  cross-ambiguity 
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Property  1:  The  gradient  function  <&(t)  is  a  bounded  function  if  the 
transmit  signal  x(t)  is  also  bounded,  i.e.,  |x(t)|  K  for  all  t  R^. 

This  can  be  shown  by  first  finding  the  magnitude  of  <t(t)  and 
applying  the  Schwartz  inequality  to  (4.3-9)  as  follows: 

oo 

l<Kt)|  <  II  lx*  cr(T»|t>)|  MT,<I>)  |x(t-t)|  dTd^ 

-oo  X,S  3 

oo 

<  K  //  lx-.  a(t,*)|  Ro(  t,  <(>)  dxd*.  (4.3-24) 

— <x>  ^>6 

Furthermore,  because  the  scattering  function  has  finite  volume,  and 
|Xx,g(T,$)|  <  1  by  (3.3-4), 

OO 

|*(t)|  <  K  If  Rg( T, <}>)  dxd4>  <  »,  (4.3-25) 

—  OO 

so  4(t)  is  a  bounded  function. 


Property  2:  The  gradient  function  $(t)  is  an  element  of  L^(R^)  and 
L2(RL). 

Showing  that  $(t)  £  L^R^)  can  be  done  by  using  (4.3-24), 


I  |$(t)|  dt  <  I 


IS  I  ^  ^  g  C  T ,  <}> )  I  Rg(  x,  <t>)  jx(t-x)|  dxd<(> 


I  | x( t  —  x ) |  dt 


-  H  )x*ig<T.*)l  Rg^1^ 

Since  x(t)  is  in  L^(R^)  and  L2(R*)  and  | Xx,g( T> I  <  1, 

OO  00 

I  |*(t)|  dt  <  Hxllj^  II  Rg  (  x ,  <J>)  d  xd  <  ■». 


d  xd  (p. 


dt 


(4.3-26) 


(4.3-27) 


Hence,  $(t)  is  magnitude  integrable,  and  it  is  an  element  of  L^(R^). 
Also,  because  it  is  bounded,  by  Theorem  2.7  it  is  also  an  element  of 
L2(R1). 


Property  3:  The  inner  product  of  the  processing  signal  g(t)  and  the 
gradient  function  <i(t)  is  equal  to  the  value  of  the  cost  functional, 


73 


i  •  e  • , 


<g,<J>>  =  /  gU)  **(t)  dt  =  J(g)  . 


(4.3-23) 


This  is  shown  by  expanding  the  left  side  of  (4.3-28)  using  the 
definition  of  the  gradient  function  in  (4.3-9)  and  rearranging  the 
integrand  as  follows: 


<g,4>  =  /  g(t) 


//  y  _(*,♦)  Rs(t,<^)  x*(t-t)  e~j  2-rrq>(t:-T/2) 
-00  ® 


dt 


//  X_  _<*.♦)  Rs ( t , <J> ) 

—  CO  *  ® 

00 

//  Xjj^gCT,*)  Rs(^,4>) 


/  x*(t-T)  g(t)  e-J2w*(t-T/2)  dt 


/  x(t-t/2)  g*(t+t/2)  e^  7T^>t  dt 


did  <}) 

d  xd  d>. 
(4.3-29) 


The  bracketed  term  in  the  integrand  of  (4.3-29)  is  equal  to  the 
uncertainty  function  Xx, g(T» $) »  so  equation  can  be  rewritten  as 

00 

<g,$>  =  II  Ix_  ,,(t,4>)|2  Mt,4>)  clxd q>.  (4.3-30) 

But  the  right  side  of  (4.3-30)  is  the  cost  functional,  thus  <g,$>  = 

J(g). 

By  itself,  this  property  is  of  little  consequence.  It  is,  however, 
useful  in  establishing  the  next,  and  final,  property  of  the  gradient 
function. 


Property  4:  The  norm  of  the  gradient  function  4>(t)  is  greater  than  or 
equal  to  the  value  of  the  cost  functional. 


Because  both  g(t)  and  <5(t)  are  elements  of  L^(R^)  and  L.2(r1)  ;hey  • 
both  have  finite  energies,  in  fact,  li g U 2  =  1  by  convention.  Therefore, 


B 

a 


by  applying  the  Schwartz  inequality  to  (4.3-28)  gives 

J(g)  =  |j(g)|  =  ( <g,  <£•>[  <  llg«2  **I2  =  \\nr  (4.3-31) 

4.4  Optimization  of  the  Expected  Matched  Filter  Output 
4.4.1  The  Increment  of  the  Cost  Functional 

In  section  4.3.3  it  was  shown  that  the  cost  functional  could  be 
expanded  into  the  generalized  Taylor  series 

CD 

j(g+n)  =  J(g)  +  2  Re {/  n(t)  $*(t)  dt}  +  J(n),  (4.3-23) 

—  00 

where  g(t)  is  the  processing  signal  and  an  element  of  L^-(R^)  and  L^(r1), 
n(t)  is  an  arbitrary  element  of  L^(R^),  and  $(t)  is  the  gradient 
function  given  by 

00 

*(t)  -  //  X*  (*,♦)  Rs(  T,  ,J>)  X(t-T)  ej27T*(t~T/2)  dTd*.  (4.3-9) 

Consider  the  increment  of  the  cost  functional  J(g)  with  increment  n(t), 
defined  as 

AJ(g,n)  *  J(g+n)  -  J(g) 

00 

=  2  Re {/  n(t)  **(t)  dt)  +  J(n).  (4.4-1) 

—  OO 

Since  J(n)  is  always  a  non-negative  number,  if  the  increment  AJ(g,n)  is 
to  be  positive,  then  it  is  necessary  to  choose  n(t)  in  such  a  fashion  to 
guarantee  that  the  first  Gateaux  derivative,  the  integral  term  in 
(4.4-1),  is  positive.  This  can  be  done  by  choosing  the  increment  n(t) 
to  be 


h(t)  =  Y  <t>(t) 


(4.4-2) 


where  p  is  a  positive  real  number.  Using  this  rule  causes  the  increment 
of  the  cost  functional  to  become 


AJ(g, n)  =2  y  J  |$(t)|2  dt  +  J(y«)  =  2  Y  »<&«2  +  J(Y*)  >  0  .  (4.4-3) 

Notice  that  by  the  Schwartz  inequality,  choosing  the  increment  using 
(4.4-2)  maximizes  the  value  of  the  first  Gateaux  derivative. 

The  choice  of  the  name  'gradient  function'  for  the  function  <5(t)  can 
now  be  easily  explained.  Consider  the  analogy  of  being  at  some  point  on 
a  surface  which  can  be  modeled  as  a  function  of  two  real  variables 
f(x,y).  If  one  wants  to  move  in  the  direction  that  will  cause  the 
greatest  positive  change  in  elevation,  then  one  moves  in  the  direction  of 
the  gradient  vector  given  by 

Vf(x,y)  =  (x,y)  l  +  -|£  (x,y)  j.  (4.4-4) 

By  choosing  the  direction  vector  of  travel  to  be  v  =  Y  Vf(x,y)  where  Y  is 
a  positive  real  scalar  will,  to  first  order,  guarantee  a  positive 
increase  in  elevation,  denoted  by  h,  which  is  approximately 

Ah  =  Y  IIVf(x,y)ll2  (4.4-5) 

where  II  •  II  is  the  vector  norm  given  by  the  Pythagorean  theorem.  Notice 
the  similarity  between  (4.4-4)  and  the  first  term  on  the  right  side  of 
(4.4-3). 

Equation  (4.4-4)  essentially  extends  from  the  concept  of  a 
directional  derivative.  In  the  general  case  the  value  of  the  change  in 
elevation  approximated  by 

Ah  =  <yn,  Vf(x,y)>  =  y  <n,7f(x,y)>,  (4.4-6) 

where  n  is  a  direction  vector  with  unity  length  and  y  is  the  horizontal 
length  of  travel.  This  relates  well  to  the  first  Gateaux  derivative 
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(first  variation)  <5J(g,n),  which  is  to  first  order  an  estimate  of  the 
change  of  the  value  of  the  cost  functional  when  the  function  g(t)  changes 
to  g(t)  +  n(t).  In  this  case,  if  n(t)  =  Y  z(t)  then 

00 

<SJ(g,Yz(t))  =  2  Re{/  Y  z(t)  $>*(t)  dt}  3  2  y  Re{<z,$>}.  (4.4-7) 

—  CO 

Since  both  (4.4-6)  and  (4.4-7)  are  of  a  similar  form  it  is  natural  to 
refer  to  g(t)  as  the  'gradient  function.' 


4.4.2  The  Gradient  Projection  Algorithm 

In  the  last  section,  it  was  shown  that  if  the  increment  n(t)  was  a 
scaled  replica  of  the  gradient  function  <f(t),  then  the  increment  of  the 
cost  functional  is  positive.  If  this  approach  is  used  to  choose  a  new 
processing  signal  g(t)  by  letting  g(t)  3  g(t)  +  n(t),  then  it  can  be  seen 
immediately  that  there  is  no  guarantee  that  this  processing  signal  has 
unit  energy.  Therefore,  the  increment  n(t)  must  be  chosen  not  only  to 
insure  a  positive  change  in  the  cost  functional  but  also  guarantee  that 
the  new  processing  signal  has  a  unity  norm.  In  other  words,  g(t)  *  g(t) 

+  n(t )  is  an  element  of  the  set  S  where 


S  3  (y  £  L2(Rl)  |  B y II 2  =  1}.  (4.2-1) 

One  way  to  do  this  is  to  set  n(t)  3  Y  $(t),  giving  a  new  processing 
signal  equal  to  g(t)  +  Y  <Kt)  and  then  projecting  this  function  onto  the 
set  S  by  using  its  best  approximation  in  S  which  by  Theorem  4.2  is 


g(t) 


g(t)  +  Y  $(t) 
Ilg+Y't’H  2 


(4.4-8) 


The  actual  increment,  denoted  as  5g(t),  in  this  case  is 


5g(t)  3  g(t)  -  g(t)  3  g(C'iig+T^||^(t)  -  g<t). 


(4.4-9) 
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Figure  11  illustrates  the  process  of  choosing  a  new  processing 
signal  if  the  analogy  of  signals  represented  by  vectors  is  used. 

The  method  of  choosing  a  new  processing  signal  presented  above  will 
be  applied  recursively  to  increase  the  value  of  the  cost  function,  which 
in  turn  implies  that  the  expected  value  of  the  matched  filter  receiver 
will  increase.  Ip  terms  of  the  processing  signal,  this  means  it  will 
more  closely  match  the  form  of  the  signal  at  the  output  of  the  channel 
whose  scattering  function  is  t,  $). 

The  cost  functional,  defined  as  the  expected  value  of  the  matched 
filter  receiver  output  to  the  channel  output,  given  by 


J(g)  ■  //  |xx>g(T,1»)|2  Rschn(t»<(,)  dTd^  » 

will  be  increased  recursively  using  the  following  procedure: 

1.  Choose  a  transmit  signal  x(t)  and  an  initial 
processing  signal 

2.  Calculate  the  initial  uncertainty  function  Xx,gj/ T> 

3.  Calculate  the  gradient  function  given  by 


(4.4-10) 


00 

MO  -  //  X*  „  <T,$)  R„  (T,40  X(t-T)  2ir^(t— x/2)  dTd^ 

1  (4.4-11) 


Form  the  new  processing  signal 


gi+i(b) 


gt(t)  +  y  4>.(t) 


(4.4-12) 


Calculate  the  uncertainty  function  Xx,gi+^  ( t»  <P)  • 

Calculate  the  cost  functional  J(gi+i)  • 

Return  to  step  3  or  stop  if  a  maximum  amount  of  interations 
have  been  reached  or  the  value  of  J(gi+i)  equals  or  exceeds  a 
predetermined  threshold. 


In  general,  there  is  no  way  to  perform  each  step  in  closed  form. 


particular,  closed  form  calculation  of  the  cross-ambiguity  or  gradient 
functions  presents  an  unwieldy  task.  Therefore,  all  the  equations  must 
be  discretized  and  the  optimization  procedure  done  numerically. 


4.4.3  Convergence  of  the  Gradient  Projection  Algorithm 

Determining  the  convergence  of  the  gradient  projection  algorithm 

presented  in  section  4.4.2  is  done  by  examining  the  behavior  of  the 

increment  of  the  cost  functional.  By  showing  that  the  increment  is 

always  a  non-zero  positive  value  for  any  non-zero  positive  step  size  Y 

00 

will  imply  that  the  sequence  {J(g^))n_^  monotonically  increases  and  will 
converge  because  it  is  bounded. 

To  begin,  it  is  necessary  to  determine  the  processing  signal 
increment  for  the  i-th  step  of  the  recursive  algorithm  which  by  (4.4-9) 
is 


Sg^t)  = 


gi(t)  +  y  ^(t) 


lgi+Y*ill2 


"  g^t). 


(4.4-13) 


Substituting  (4.4-13)  into  (4.4-1)  gives  the  increment  of  the  cost 
functional  at  the  i-th  iteration  as 


AJ(gi.figi)  =  2  Re{<5gi,Y'J|i>}  +  •J(<5gi) 


2  Re{<gi,$i>+<$.,4’.>Y} 


»gi+Y$i»2 


-  2  Re{<gi,0.>}  +  J(Sg.). 


(4.4-14) 


By  property  3  of  the  gradient  function,  the  numerator  in  (4.4-14)  can  be 
restated  as 


2  Re  {<g1$i>+<$i ,  '*’i>Y>  -  2  Re(J(g1)+l*ll2Y} 

-  2  [J(g  )  +  I*  II?  Y]. 


(4.4-15) 


Because  J(g^),  11  *&!. 11 2  *  an<*  ^  are  a^-  real  numbers  the  operation  of 
finding  the  real  part  of  the  complex  number  is  not  required  in 
(4.4-15).  The  denominator  of  the  first  term  on  the  right  side  of 
(4.4-14)  can  also  be  simplified  using  (2.2-8),  property  3  of  the  gradient 
function,  and  the  fact  that  gi(t)  is  a  unit  energy  signal  as  follows: 

Hgi+T^i  "2  =  <gi+y&i.g1+Y‘S>i> 

*  <gi»gi>  +  2  Re{<gi,$i>}  y  +  <$1,<ti>  Y2 

*1+2  J(gi)  y  +  »2  Y2.  (4.4-16) 

The-efore,  by  (4.4-15)  and  (4.4-16),  the  increment  of  the  cost  functional 
is 

2  [J(g  )  +  ■*  82  y] 

AJ(g1,6g1)  - - - - - -  - -  2  J(g.)  +  J(6gi).  (4.4-17) 

/I  +  2  J(gt)  y  +  Y2 

Since  J(6g^)  is  always  a  positive  value,  it  is  only  necessary  to 
determine  if  the  sum  of  the  first  two  terms  of  the  right  side  of 
(4.4-17),  the  first  Gateaux  derivative  of  J(gi),  is  always  positive.  By 
examination  of  (4.4-17),  it  is  seen  that  the  first  Gateaux  derivative  is 
a  function  of  the  step  size  Y»  therefore, 

,  2  [J(g  )  +  I®  II2  y] 

F(y)  =  6J(gi,<5gi)  =  ■  -  - - - - -  2  J(gt).  (4.4-18) 

/I  +  2  J(gi)  Y  +  il2  Y2 

The  function  F(y)  is  continuous  for  all  y  >  0  and  can  be  shown  to  be 
monotonic  by  examining  its  first  derivative.  Since  differentiating 
(4.4-18)  involves  considerable  algebraic  manipulation,  the  details  are 
omitted  here,  and  only  the  result  stated,  which  is 
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dF(Y)  = 
dY 

2  [ « <t>i  « 2  -  d(g.)2]  Hgi+Y4>i  "2 

1^*2  Y4  +  4  J(gj_)  « ^  «|  Y3  +  2  [11$.  II2  +  2  J(gi)2]  Y2  +  4  J(g.)  Y  +  1 

(4.4-19) 

The  denominator  of  (4.4-19)  is  a  polynomial  in  y  with  positive 
coefficients  because  both  J(gi)  and  II II 2  are  positive;  therefore,  for 
all  Y  >  0  its  value  is  always  greater  than  or  equal  to  1.  As  for  the 
numerator,  it  is  always  non-negative  since  the  norm  term  is  always 
positive,  and  because  by  property  4  of  the  gradient  function, 

>  J(g1)2.  (4.4-20) 

Therefore,  for  all  Y  >  0,  the  first  derivative  of  F(y)  is  non-negative. 
From  elementary  calculus  it  is  known  that  if  the  first  derivative  of  a 
function  is  non-negative  on  an  interval,  then  the  function  monotonically 
increases  on  that  interval;  consequently,  F(y),  or  the  first  Gateaux 
derivative  6J(g^ , 6g^) ,  increases  monotonically  for  Y  in  [0,°°).  If  the 
step  Y  is  zero,  then  from  (4.4-13)  and  (4.4-18) 

SJ(git5gi)  =  6J(gi,0)  =  F(0)  =  0  .  (4.4-21) 

Thus  F(y)  =*  dj(g^ ,  5gj_)  is  non-zero  for  all  Y  >  0.  It  now  follows  that 
because  the  second  Gateaux  derivative  is  always  non-negative,  then 

AJ(gi,5gi)  >0  for  all  y  >  0  .  (4.4-22) 

This  in  turn  implies 

J(gi+1)  >  for  a11  Y  >  °.  (4.4-23) 

00 

From  (4.4-23)  it  is  guaranteed  that  the  sequence 
monotonically  increases  if  y  >  0.  Furthermore,  the  sequence  is  bounded 
from  above  by  (3.3-6)  giving  J(g^)  <  l.  From  elementary  calculus,  it  is 
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known  that  a  bounded  monotonic  sequence  of  real  numbers  is  convergent. 
Thus , 


lim  J(gi)  =  J  <  1  .  (4.4-24) 

+oo  r 


The  analysis  presented  above  proves  that  the  gradient  projection 
algorithm  will  increase  the  expected  matched  filter  output.  It  should  be 
noted  that  it  does  not  guarantee  that  the  sequence  of  processing  signals, 

<30 

{g^)n!^,  converges.  However,  what  is  guaranteed  is  that  each  successive 
gi(t)  produces  a  better  average  receiver  output.  This  extends  from  the 
fact  that  the  cross-ambiguity  function  is  not  unique  for  a  given  pair  of 
transmit  and  processing  signals.  Since  the  value  of  the  expected  matched 
filter  output,  the  cost  functional 


<30 

J(gi)  "  // 

— <30 


|x  _  (T,<j>)|2  R  C  T,  d>)  dTd<j> 
^’gi  bCHN 


(4.4-25) 


is  related  to  g^(t)  via  the  cross-ambiguity  function,  the  value  of  J(gi)  is 
not  unique  for  a  given  g^(t).  It  should  be  realized  that  this  does  not 
invalidate  the  optimization  procedure  presented  in  this  chapter  since  it 
does  insure  an  increase  in  J(g^).  Therefore,  the  nonuniqueness  of  each 
gi ( t )  in  the  optimization  sequence  is  of  little  consequence. 


4. 5  Examples  of  Matched  Filter  Optimization 

Since  the  optimization  procedure  presented  in  section  4.4.2  must  be 
implemented  on  a  digital  computer,  it  is  necessary  to  discretize  the 
equations  used  in  the  procedure. 

Consider  the  uncertainty  function  derived  in  Chapter  3  and  given  by 
00 

^(*,40  =  I  x(t— t/2)  g*(t+x/2)  e''2lT(*>t 


dt  . 


(3.2-37) 
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By  invoking  a  change  of  variable  and  defining  the  function 
v(t,4>)  =  g*(t)  e3 ^TT^t  ^ 


(4.5-1) 


equation  (3.2-37)  can  be  restated  as 


Y  (x,<fr)  =  /  x(t-x)  v(t,<|>)  dt  .  (4.5-2) 

*»o  _oo 

The  integral  in  (4.5-2)  can  be  approximated  by  a  sum  by  setting  tf>  ®  l 
(A4>),  t  =  m  (At),  and  x  =  n  (At),  and  by  replacing  the  integral  sign  by  a 
summation  symbol  giving 


X  (n(Ax),*(A<j>)) 

*  >  a 


e-j7rnA(Ai}>)(AT)  ^ 
m=l 


x(m(At)-n(  At))  v(m(  At)  ,  £(  A4>) )  At. 

(4.5-3) 


A  simplification  can  be  made  by  setting  At  =  At,  thus  allowing  the 
definition  of  the  discrete  sequences 

x(m(At)-n(Ax))  «  x((m-n)(At)) 

=  x(m-n)  (4.5-4) 

v(m(At)  ,  £(  A<j>))  =  v(m(AT),  £(A<)>)) 

=  v(m, l)  .  (4.5-5) 

Substituting  (4.5-4)  and  (4.5-5)  back  into  (4.5-3)  gives  for  the 
discretized  uncertainty  function 

y  (n(Ar),  £(  A<p) )  =  At  e  3 1Tnl(AT)  (  A4>)  £  x(m-n)v(m,  l) .  (4.5-6) 

^’g  o-l 


The  summation  in  (4.5-6)  is  actually  a  correlation  between  the 
sequence  x(n)  and  v(n,&),  so  it  can  be  calculated  using  discrete  Fourier 
transforms. Therefore,  it  can  be  shown  that  (4.5-6)  can  be  written  as 


(n( At)  ,  £(  A<j>)  )  =  At  e~3  A't') C  Ax)  IDFT{DFT(x(_n))  DFT(v(n,  A))  } 

(4.5-7) 


where 


N-l 


DFT(x(n))  =  £  x(n)  e  J^nkn/N  =  x(k) 

n=0 


(4.5-8) 


is  the  forward  discrete  Fourier  transform  of  the  sequence  x(n)  and 


IDFT(X(k) )  =  i  l  X(k)  ej2llkn/N 
k=0 


(4.5-9) 


is  the  inverse  discrete  Fourier  transform  of  the  sequence  of  Fourier 
coefficients  X(k). 

Another  equation  that  requires  discretization  is  the  formula  for  the 
gradient  function  given  by 


*(t) 


//  XJ, 

_a> 


UA) 


Rq  (T, 

CHN 


<j>)  x(t-T)  e 


j2TT<t>(t-T/2) 


d  Td  (j>. 

(4.5-10) 


By  defining  the  functions 


Q( T,  d>)  =  **  (  t,  <#►)  R_  (  T,  4>)  e 

X,°  CHN 

w(t,4>)  =  x*(t)  e-’27T^t 


equation  (4.5-10)  can  be  restated  as 
00 

$(t)  =  //  9( T,  <J»)  w(t-T,(j))  d Td ( 


-jTT^T 


/ 


J  9(  T ,  4»  )  W(t-T,<j>)  dT 


d<t 


/  9(t,<{i)*w(t,^)  d 4>  • 


(4.5-11) 

(4.5-12) 


(4.5-13) 


As  before,  the  integral  in  (4.5-13)  can  be  approximated  by  a  sum  by 
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setting  <(>  =  i  (A4>)  and  t  =  n  (At)  and  by  replacing  the  integral  sign  by  a 
summation  symbol  giving 

$(n(AT))  =  1  [0(n(At)f  £(  A<{>)  )*x(n(  At)  ,  £(  A<{>))  At]  A<j>  .  (4.5-14) 

l 

By  defining  the  two-dimensional  sequences 


e(n(AT),£(A<]>))  =  9(n,£), 
w(n(AT),  A(  A4>) )  =w(n,2), 
$(n( At) )  ^  *(n), 


(4.5-15) 

(4.5-16) 

(4.5-17) 


and  substituting  them  into  (4.5-14)  gives 


3(n)  =  (At)  (A<Ji)  l  9(n,£)*w(n ,  l) , 
l 


(4.5-18) 


where  the  discrete  convolution  inside  the  sum  of  (4.5-18)  is  with  respect 
to  the  variable  n.  Again  the  convolution  can  be  calculated  using 
discrete  Fourier  transform,  therefore 


$(n)  =  (At)  ( A<j>)  [  ID  FT  (DFT(  0(n,  Z) )  DFT(w(n,  l) ) }  . 

I 


(4.5-19) 


Now  that  the  equations  giving  the  uncertainty  function  and  gradient 
function  are  in  a  useable  form,  the  gradient  projection  algorithm  can  now 
be  stated  in  a  form  suitable  for  implementation  on  digital  computer: 

1.  Choose  a  discretized  transmit  signal  x(n)  and  an  initial 
discretized  processing  signal  g^(n). 

2.  Calculate  the  initial  discretized  uncertainty  function 

X,  „  (n,£). 

x,gL 

3.  Calculate  0j.(n,£)  and  wj_(n,&). 

A 

4.  Calculate  the  gradient  function  ^(n)  using  (4.5-19). 

5.  Form  the  new  discretized  processing  signal 
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si+lCn) 


g^(n)  +  Y  $t(n) 


At  /  l  J  g±  (n  )+y 5^^  (n )  | 


(4.4-20) 


6.  Calculate  the  discretized  uncertainty  function  x 


(n, Z). 


x,g 


i+1 


7.  Calculate  the  value  of  the  cost  functional  using 

J.+1  =  (At)  ( A<j>)  l  l  |X_  (n,£)|2  Rg  (n ,1) 

n  £  x»§i+l  TGT 


(4.4-21) 


where 


R  (n,i)  =  R  (n(  At)  ,  £(  A4>) ) .  (4.5-22) 

CHN  CHN 

8.  Return  to  step  3  or  stop  if  a  maximum  amount  of  iterations 
have  been  reached  or  the  value  of  equals  or  exceeds  a 

predetermined  threshold. 

The  version  of  the  gradient  projection  algorithm  given  above  was 
implemented  on  the  VAX1 1/782  at  the  Applied  Research  Laboratory.  The 
algorithm  was  written  as  a  FORTRAN  77  program,  was  run  in  a  low  priority 
batch  queue,  and  for  these  examples  required  about  one  hour  of  CPU  time. 

The  transmit  signal  and  initial  processing  signal  are  both  analytic 
signals  with  ten  percent  raised  cosine  windows.  Both  were  linear 
frequency  modulated  with  the  transmit  signal  being  a  200  Hz  upchirp  and 
the  processing  signal  a  200  Hz  downchirp.  Figures  12  and  13  show  the 
magnitude  and  the  real  and  imaginary  parts  of  the  envelopes  of  both 
signals.  Figure  14  shows  the  initial  cross-ambiguity  function  derived 
from  these  two  signals.  Figure  15  shows  the  scattering  function  which 
consists  of  three  two-dimensional  Guassian  pulses  each  with  oT  =  0.015 
sec.  and  a ^  =  25.0  Hz.  One  pulse  is  centered  on  the  t  axis  at  t  =  0.065 
sec.  and  the  remaining  two  pulses  are  centered  at  x  =  0.155  sec.  and  <}>  = 


-  75  Hz.  Furthermore,  the  pulses  have  been  scaled  so  that  the  volume  of 
the  scattering  function  (the  ratio  of  the  average  return  energy  to  the 
transmit  energy)  is  0.2.  For  the  discretization  of  all  functions,  the 
values  At  =  At  =  0.00025  sec  and  A$  ■  10  Hz  were  used  and  the  step  size 
was  set  at  y  =*  5.0. 

The  program  iterated  15  times,  and  an  increase  in  the  cost  functional 
occurred  at  each  iteration.  This  is  shown  in  Figure  16.  Figure  17 
shows  the  magnitude  and  real  and  imaginary  parts  of  the  final  processing 
signal  and  Figure  18  shows  the  cross-ambiguity  function  derived  from  the 
transmit  signal  and  final  processing  signal. 

By  comparing  Figures  14  and  18,  it  can  be  seen  that  the  volume  of  the 
cross-ambiguity  function  has  been  redistributed  from  one  large  plateau 
into  two  ridges.  Each  of  these  ridges  subtends  nearly  the  same  region  in 
the  ( t— d> )  plane  as  do  the  three  Gaussian  pulses  of  the  scattering 
function;  in  fact,  by  close  inspection  of  large  ridge  it  can  be  seen  that 
the  ends  of  the  ridge  have  assumed  a  shape  similar  to  two  of  the  pulses 
in  the  scattering  function.  What  occurred  during  the  optimization 
procedure  is  that  the  processing  signal  was  altered  to  cause  the  volume 
of  the  resulting  cross-ambiguity  function  to  collect  in  the  same 
locations  as  the  pulses  in  the  scattering  function  and  if  possible  cause 
the  cross-ambiguity  function  to  assume  the  same  shape.  This  follows  the 
principle  developed  in  section  3.3,  which  said  that  the  expected  matched 
filter  receiver  output  is  maximized  if  the  cross-ambiguity  function  is 
proportional  to  the  scattering  function  of  the  channel.  In  this  case, 
however,  the  shape  of  the  cross-ambiguity  function  never  actually  matched 
that  of  the  scattering  function.  This  occurred  because  the  processing 
signal  was  only  parameter  altered  during  the  optimization  process. 
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Figure  16.  The  value  of  the  cost  functional  at  each  iteration 
of  the  projection  algorithm  for  the  first  example. 


It  can  also  be  demonstrated  that  the  processing  signal  generated  by 
the  gradiant  projection  algorithm  is  highly  dependent  upon  the  placement 
of  the  initial  cross-ambiguity  function  in  the  plane  and  upon  the 

distribution  of  its  volume.  Figure  19  shows  a  cross-ambiguity  function 
derived  from  an  0.012  sec.  transmit  signal  and  an  0.210  sec.  processing 
signal.  Both  signals  were  contimuous  wave  tones  with  a  ten  percent 
raised  cosine  window.  The  scattering  function  used  was  the  same  one  used 
in  the  first  example  and  is  shown  in  Figure  15.  By  examining  Figures  15 
and  19,  it  can  be  seen  that  the  two  high  doppler  pulses  of  the  scattering 
function,  located  at  t  =  0.155  sec.  and  $  =  ±  75  Hz.,  are  not  fully 
subtended  by  the  cross-ambiguity  function.  Furthermore,  the 
cross-ambiguity  function  does  not  have  a  large  portion  of  its  volume  in 
the  region  of  the  plane  where  the  high  doppler  pulses  and  the 

cross-ambiguity  function  intersect. 

As  in  the  first  example,  the  computer  program  iterated  15  times,  and 
an  increase  in  the  cost  functional  occured  at  each  iteration.  This  is 
shown  in  Figure  20.  Figure  21  shows  the  final  cross-ambiguity  function. 
It  can  be  seen  that  the  optimization  procedure  placed  most  of  the  volume 
of  the  cross-ambiguity  function  at  the  pulse  of  the  scattering  funcion 
located  at  t  =  0.065  sec.  and  $  =  0  Hz.  This  is  also  the  same  pulse  of 
the  scattering  function  that  was  completely  overlayed  by  the  initial 
cross-ambiguity  function  and  where  it  had  a  significant  portion  of  its 
volume.  Thus,  the  gradient  projection  algorithm  tends  to  place  the 
volume  of  the  cross-ambigutiy  function  over  the  prominent  portions  of  the 
scattering  function  that  are  best  overlayed  by  the  initial 
cross-ambiguity  function. 
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Another  way  to  test  the  validity  of  the  gradient  projection 
algorithm  is  to  have  it  produce  a  known  result.  Figure  22  shows  a 
scattering  function  that  clos  \v  models  a  point  scatterer.  It  is  a 
single  gaussian  pulse  located  at  t  =  0.15  sec.  and  <j>  =  0  Hz.  with 
standard  deviations  of  aT  =  0.002  sec.  and  =  2.0  Hz.  The  transmit 
signal  and  initial  processing  signal  are  the  same  used  in  the  first 
example  where  the  magnitude  and  real  and  imaginary  parts  of  their 
envelopes  are  shown  in  Figures  12  and  13.  The  initial  cross-ambiguity 
function  is  shown  in  Figure  14.  The  opimization  procedure  iterated  7 
times,  and  the  final  cross-ambiguity  function  is  shown  in  Figure  23.  It 
is  an  autoambiguity  function,  which  is  a  cross-ambiguity  function  derived 
from  equal  transmit  and  processing  signals.  In  effect,  the  gradient 
projection  algorithm  dechirped  the  200  Hz.  downchirp  FM  processing  signal 
to  a  200  Hz.  upchirp  FM.  It  is  shown  in  Van-Trees ^  and  is  well  known  in 
the  literature  that  to  optimally  detect  a  point  scatterer  in  the  presence 
of  white  gaussian  noise  both  the  transmit  and  processing  signals  must  be 
equal  (this  gives  rise  to  the  autoambiguity  function). 
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CHAPTER  5 

SUMMARY  AND  CONCLUSIONS 

The  derivation  of  the  scattering  function  and  its  three  Fourier 
transforms,  under  the  assumption  that  it  describes  a  wide  sense 
stationary  uncorrelated  system  (WSSUS),  has  been  presented.  Also  it  was 
shown  that  the  scattering  function  can  be  used  to  determine  the  expected 
output  of  a  matched  filter  receiver  by  multiplying  it  with  the 
cross-ambiguity  function  and  integrating  the  resulting  two-dimensional 
function.  Finally,  a  method  for  optimizing  the  matched  filter  by 
altering  the  processing  signal  was  developed,  and  a  numerical  example  was 
given. 

It  should  be  pointed  out  that  the  cross-ambiguity  function  is  the 
bridge  chat  links  scattering  properties  via  the  scattering  function  to 
the  signals  used  in  detection.  It  is  for  this  reason  that  the 
cross-ambiguity  function  is  of  major  importance  in  the  design  of 
receivers  for  signal  detection  in  the  communication  systems.  There  are 
two  important  results  in  this  thesis  related  to  cross-ambiguity 
functions.  First,  it  was  shown  that  if  a  receiver  is  to  maximally  detect 
a  signal  at  the  channel  output,  then  the  cross-ambiguity  function  derived 
from  the  transmitted  signal  and  the  processing  signal  must  be 
proportional  to  the  channel  scattering  function.  Second,  a  method  was 
derived  to  iteratively  find  an  optimum  processing  signal  given  a  fixed 
transmit  signal  and  channel  scattering  function.  The  validity  of  the 
method  was  demonstrated  by  producing  a  known  result;  specifically,  when 


the  channel  was  modeled  by  a  point  scatterer,  the  cross-ambiguity 
function  converged  to  an  auto-ambiguity  function. 


There  are  a  number  of  extensions  to  this  work  that  could  be  made. 

For  example,  an  analysis  could  be  made  to  determine  the  rate  of 
convergence  of  the  gradient  projection  algorithm,  and  the  optimum  step 
size  could  be  determined  to  accelerate  the  optimization  procedure.  To  do 
this,  the  behavior  of  the  second  variation  of  the  cost  functional  with 
respect  to  the  step  size  would  have  to  be  examined.  Another  possible 
research  project  would  be  to  develop  a  method  of  jointly  altering  the 
transmit  and  processing  signals  with  a  global  search  to  produce  a  better 
match  between  the  cross-ambiguity  and  scattering  functions.  This  would 
be  a  generalization  of  the  problem  addressed  in  this  thesis.  It  was 
shown  that  the  optimized  processing  signal  depended  upon  how  well  the 
original  cross-ambiguity  function  overlayed  the  scattering  function  and 
to  what  extent  its  volume  was  placed  with  respect  to  the  scattering 
function  support.  As  a  result,  the  optimization  procedure  converged 
locally  over  the  whole  set  of  possible  transmit  and  processing  signal 
pairs.  The  analysis  of  a  procedure  where  both  signals  are  altered,  and 
the  cross-ambiguity  function  is  shifted  in  the  (r,<j>)  plane,  may  show  that 
it  is  possible  to  globally  maximize  the  expected  matched  filter  output 
over  all  admissible  transmit  and  processing  signals. 
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